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Preface 



This edited volume addresses problems in digital image geometry and topology, a 
special discipline of discrete mathematics. Images or discrete objects, to be ana- 
lyzed based on digital image data, need to be represented, analyzed, transformed, 
recovered etc. These problems have stimulated many interesting developments in 
theoretical fundamentals of image analysis, and this volume contains reviewing 
articles as well as more specialized contributions on theoretical (basic) or more 
applied aspects of 

— digital topology: axiomatic foundation of digital topology (E. Dominguez 
and A.R. Frances), programming, algorithmic, and data structure issues on 
cell complexes (U. Kothe and V. Kovalevsky), cellular representations of 3- 
manifolds (S. Matveev), and fixed points in discrete mappings (R. Tsaur and 
M.B. Smyth), 

— representation of images and objects: description of a spatial modeling tool 
(E. Andres, R. Breton, and P. Lienhardt), reviews on hierarchical image 
representations (L. Brun and W. Kropatsch) and simplicial multi-complexes 
for object modeling (E. Danovaro, L. De Floriani, P. Magillo, and E. Puppo), 
polyhedral 3D surface representations based on combinatorial topology (Y. 
Kenmochi and A. Imiya), and encodings of digital partitions (J. Zunic), 

— digital geometry: discrete object reconstruction (A. Alpers, P. Gritzmann, 
and L. Thorens), Hough transform (P. Bhattacharya, A. Rosenfeld, and 
I. Weiss), a review on digital lines and digital convexity (U. Eckhardt), dig- 
ital curvature (A. Imiya), Hausdorff digitization (C. Ronse and M. Tajine), 
and cellular convexity (J. Webster), 

— multigrid convergence: length measurements of curves in 3D digital space 
via minimum-length polygons (T. Biilow, R. Klette) and via digital straight 
segments (D. Coeurjolly, I. Debled-Renesson, and O. Teytaud), a review on 
multigrid-convergent feature measurement (R. Klette), 2D curve approxi- 
mation for length estimation (L. Noakes, R. Kozera, and R. Klette), and 
use of the relative convex hull for surface area estimation (F. Sloboda and 
B. Zatko), 

— shape similarity and simplification: rotation-invariant similarity of convex 
polyhedra (J.B.T.M. Roerdink and H. Bekker), surface skeletons of 3D ob- 
jects (S. Svensson), a review on 3D skeletonization (J-I. Toriwaki and K. 
Mori), iterated morphological operations (J. Van Horebeek and E. Tapia- 
Rodriguez), and optimal grouping of collinear line segments (P. Veelaert). 

These are the five parts of the volume, each containing five or six chapters, 
reflecting major developments in recent digital geometry and topology. 

The volume presents extended and updated versions of 27 talks given at 
the winterschool “Digital and Image Geometry” (December 18 - December 22, 
2000, Schloss Dagstuhl, Germany). The editors thank all the reviewers for their 
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An Axiomatic Approach to Digital Topology * 



Eladio Dominguez and Angel R. Frances 

Dpt. de Informatica e Ingenieria de Sistemas. Facultad de Ciencias 
Universidad de Zaragoza, E-50009 - Zaragoza, Spain 
cciaSposta . unizar . es 



Abstract. In a series of papers the authors have developed an approach 
to Digital Topology, which is based on a multilevel architecture. One of 
the foundations of this approach is an axiomatic definition of the notion 
of digital space. In this paper we relate this approach with several other 
approaches to Digital Topology appeared in literature through a deep 
analysis of the axioms involved in the definition of digital space. 

Keywords: digital space, lighting function 



1 Introduction 

The main purpose of Digital Topology is to study the topological properties 
of digital images. Several different approaches to Digital Topology have been 
proposed in literature. Let us recall Rosenfeld’s graph-based approach [12] (see 
also [7], [8]), Khalimsky’s approach based in a particular topological space [6], 
and Kovalevsky’s approach based on abstract cell complexes [10]. 

Digital images are discrete objects in nature, but they are usually represent- 
ing continuous objects or, at least, they are perceived as continuous objects by 
observers. In a series of papers ([l]-[4]), the authors have developed a framework 
for Digital Topology that explicitly takes this fact into account and, in addition, 
tries to unify the previous approaches. The two foundations of our approach 
are a multilevel architecture and an axiomatic definition of the notion of digital 
space. The multilevel architecture bridges the gap between the discrete world 
of digital objects and the Euclidean world of their continuous interpretations. 
On the other hand, a digital space fixes, among all the possible continuous in- 
terpretations, just one for each digital object. This continuous interpretation of 
a digital object is represented at each level of the architecture using a differ- 
ent model; in particular, the corresponding model at the continuous level is an 
Euclidean polyhedron, called the continuous analogue of the object. 

Our approach has allowed us to introduce, in a very natural way, topolog- 
ical notions which are generalizing the corresponding notions given in other 
approaches. For example, we say that a digital object is connected (a digital 
manifold) if its continuous analogue is a connected polyhedron (a combinatorial 

* This work has been partially supported by the projects DGES PB96-1374 and DGES 
TIG2000-1368-C03-01. 
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manifold, respectively); and we have showed that, for suitable digital spaces, this 
notion of connectivity coincides with the (a, /3)-connectivity defined on the grid 
7Z^ , for a,[3 & {6,18,26} and {a, (3) yf (6,6), within the graph-theoretical ap- 
proach. Moreover, the digital 2-manifolds (i.e., digital surfaces) in those digital 
spaces are the (a, /3)-surfaces ([7]). And similarly, our notion of digital manifold 
generalizes also the strong 26-surfaces defined by Bertrand and Malgouyres [5]. 
In addition, the bridge provided by the multilevel architecture gives us a general 
method to obtain results in Digital Topology by translating the corresponding 
results from continuous topology. We have obtained using this method general 
digital versions of the Jordan-Brouwer ([2]) and Index Theorems ([3]) for our 
digital manifolds, which are generalizing the well-known result for (26,6)- and 
(6,26)-surfaces due to Morgenthaler and Rosenfeld [11]. Recently, we have also 
introduced a notion of digital fundamental group for arbitrary digital spaces and 
obtained a digital version of the Seifert-Van Kampen Theorem [4]. 

As it was quoted above, one of the foundations of our approach is the notion of 
digital space. A digital space consists of a polyhedral complex K and a function 
that associates each digital object with a set of cells of K from which it is derived 
the continuous analogue of the object. That function must satisfy a certain set 
of axioms intended for limiting the set of possible continuous analogues to those 
which are not contradictory with the natural perception of objects. 

The main goal of this paper is to analyze these axioms and, along the stages 
of this analysis, to relate our framework with some other approaches appeared 
in literature. 



2 Our Approach to Digital Topology 

In this section we briefly summarize the basic notions of our approach to Digital 
Topology as introduced in [2], as well as the notation that will be used in this 
paper. 

We represent the spatial layout of the image elements (that we will call pixels 
in 2D, voxels in 3D and xels in nD) of a digital image by a deviee model, which 
is a homogeneously n-dimensional locally finite polyhedral complex K . Only 
the n-cells in a device model K are representing xels, while the other lower 
dimensional cells in K are used to describe how the xels could be linked to each 
other. In this way, the objects displayed in digital images are subsets of the 
set cell„(Rr) of n-cells in a device model K] and, thus, we say that a subset 
O C cell„(Rl) is a digital object in K. 

By a homegeneously n-dimensional locally finite polyhedral complex we mean 
a set K of polytopes, in some Euclidean space K'^, provided with the natural 
reflexive, antisimetric and transitive binary relationship “to be face of” , that in 
addition satisfies the four following properties: 

1. li a G K and r is a face of tr then t G K. 

2. If a,T G K then cr n r is a face of both cr and r. 
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3. For each point x in the underlying polyhedron |A"| = Ujcr; cr € AT} of AT, 
there exists a neigbourhood of x which intersects only a finite number of 
polytopes in K\ in particular, each polytope of K is face of a finite number 
of other poly topes in AT. 

4. Each poly tope tr G A' is face of some n-dimensional poly tope in K. 

These complexes are particular cases of cellular complexes, as they are usually 
defined in polyhedral topology. So, in this paper, polytopes in K will be sim- 
ply referred to as cells, and K itself will be called a polyhedral complex. Next 
paragraph recalls some elementary notions from polyhedral topology used in this 
paper. We refer to [13] for further notions on this subject. 

Given a polyhedral complex K and two cells 7, cr G AT, we shall write 7 < cr 
if 7 is a face of tr, and 7 < tr if in addition 7 ct. A centroid-map on A' is a map 
c: K ^ \ K\ such that c(cr) belongs to the interior of ct; that is, c{a) Git= a — da, 
where da = U{ 7 ; 7 < ct} stands for the boundary of a. 

Remark 1. Most examples in this paper are given on the device model A", 
termed the standard cubical decomposition of the Euclidean n-space M". The 
device model A" is the complex determined by the collection of unit n-cubes in 
M" whose edges are parallel to the coordinate axes and whose centers are in the 
set Zi” . The centroid-map we will consider in A" associates each cube a with its 
barycenter c{a), which is a point in the set Z”. Here, Z = stands for the set 
of points {z G M; z = y/2, y G Z}. In particular, if dimer = n then c(cr) G Z", 
where dim a denotes the dimension of a; and, thus, every digital object O in A" 
can be identified with a subset of points in Z". 

The topological properties of a digital image depend on the “continuous 
interpretation” that an observer makes of the object displayed in that image. The 
selection of such a continuous interpretation for each digital object is represented 
by a lighting function defined on the device model AT. In the definition of lighting 
functions we use the following notions, which are illustrated in Fig. 1 for an object 
in the device model A^. 




Fig. 1. The support of an object O and two types of digital neighbourhoods 
in O for a cell a. The cells in O together with the bold edges and dots are the 
elements in supp(O) 
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The first two notions formalize two types of “digital neighbourhoods” of a 
cell a e itr in a given digital object O C cell„(AT). Indeed, the star of a in O 
is the set st„(o;; O) = {a € O; a < a} of n-cells (xels) in O having a as a face. 
Similarly, the extended star of a in O is the set st* (a; O) = {cr € O; a H cr yf 0} 
of n-cells (xels) in O intersecting a. 

The third notion is the support of a digital object O which is defined as 
the set supp(O) of cells of K (not necessarily xels) that are the intersection of 
n-cells (xels) in O. Namely, a G supp(O) if and only if a = n{a; cr € st„(o;; O)}. 
In particular, if a is a pixel in O then a G supp(O). Notice also that, among all 
the lower dimensional cells of K, only those in supp(O) are joining pixels in O. 

To ease the writing, we use the following notation: supp(itr) = supp(cell„(Ar)), 
stn{a;K) = st„(o;; cell„(AT)) and st*(a; AT) = st* (a; cell„(AT)). Finally, we shall 
write V{A) for the family of all subsets of a given set A. 

Definition 1. Given a device model K, a lighting function on K is a map 
f : V{celln{K)) x K ^ {0, 1} satisfying the following five axioms for all O G 
V{celln{K)) and a G K: 

(1) object axiom.- if dim a = dim AT, then f{0,a) = I if and only if a € O; 

(2) support axiom.- if a ^ supp{0) then f{0,a) = 0; 

(3) weak monotone axiom.- f{0,a) < f {cell„{K) , a) ; 

(4) weak local axiom.- f{0,a) = /(st* (a; O), a); and, 

(5) complement’s connectivity axiom.- if O' C O C celln{K) and a € K are such 
that stn{or,0) = st„{a;0'), f{0',a) = 0 and f{0,a) = 1, then: (a) the set 
of cells a{0'; O) = {P < a; f{O',0) = 0, /(0,/3) = 1} is not empty; (b) the 

o 

set U{/3;/3 G a(0';0)} is connected in da; and, (c) if O C O C celln{K), 
then f{0,P) = 1 for every /3 G a(0'; O). 

If f{0,a) = 1 we say that f lights the cell a for the object O; and the set 
Lb{0) = {a G AT; f{0,a) = 1} consisting of all cells in K that f lights for O is 
said to be the light body ofO. 

So, according to the remarks above, we introduce the following 

Definition 2. A digital space is a pair {K, f), where K is a device model and f 
is a lighting function on K. That is, a model for the spatial layout of xels and a 
continuous interpretation for every possible digital image drawn using these xels. 

A lighting function may also satisfy some of the two following axioms: 

(a) strong monotone axiom: f{Oi,a) < /(C> 2 ,a) for any pair of digital ob- 
jects Oi C C >2 C cell„(iF) and any cell a G AT; and, 

(b) strong local axiom: f{0,a) = f{st„{a;0),a) for every O C cell„(AT) and 
a € K. 

Definition 3. A lighting function f is said to be strongly monotone ( strongly 
localj if it satisfies the strong monotone (respectively, strong local) axiom; oth- 
erwise, f is called weakly monotone ('weakly local, respect.). 
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The term lighting function was introduced for the first time in [1] to refer 
what we call here a strongly local lighting function. While, in [2], the general no- 
tion of lighting function given in Definition 1 was termed weak lighting function. 
The original set of axioms in [1] and [2] involved the following property 

(I) for all O C cell„(AT) and a £ K, f{0, a) = 1 whenever a £ O 

instead of axiom (1) in Def. 1. Notice, however, that the conjunction of axioms 
(l)-|-(2) is equivalent to (I)-|-(2). Observe also that the strong local and strong 
monotone axioms imply the corresponding weak axioms; and, in addition, the 
strong local axiom also implies axiom (5) in Definition 1. Apart from these 
relations, it is not hard to find examples showing that axioms (l)-(5) in Def. 1 
and the strong monotone and local axioms are independent; see Section 4. 

We will also use the following notion in this paper. 

Definition 4. Let K be a device model. A light body map on K is a mapping 
Lb : V{celln{K)) — > V{K) that assigns a subset of cells of K to each digital 
object O C celln{K). 

Remark 2. Notice that, each lighting function on a given device model K defines 
a light body map on K. Moreover, the axioms given in Def. 1 can be rewritten 
in terms of the light body of objects as follows 

(!’) Object axiom: Lb(0) n cell„(iT) = O. 

(2’) Support axiom: Lb(0) C supp(O). 

(3’) Weak monotone axiom: Lb(0) C Lb(cell„(AT)). 

(4’) Weak local axiom: a £ Lb(0) if and only if a S Lb(st* (a; O)). 

(S’) Complement’s connectivity axiom: If C O and a £ Lb(0)— Lb(O') is such 
that stn(a;0) = stn(a;0'), then a{0';0) = daH (Lb(0) — Lb(O')) yf 0, 

the set Ll{P; P £ a{0';0)} is connected in da, and a{0';0) C Lb(0) for 
any digital object O A O. 

(a’) Strong monotone axiom: if 0\ C O 2 then Lb(Oi) C Lb(C? 2 )- 
(b’) Strong local axiom: a £ Lb(0) if and only if a G Lb(st„(a; O)). 

Henceforth, we will identify a lighting function with the corresponding light body 
map without further comment. 

Given a digital space (AT,/), any digital object O in K can be naturally 
associated with the subspace |Lb(0)| = Ll{a;a £ Lb(0)} of the underlying 
polyhedron | AT | . Nevertheless, in general, this topological space is not suitable 
to represent the continuous interpretation of O provided by /. For instance, a 
digital object O = {a} consisting of a single n-cell tr G A' is usually perceived 
as a point; while, for any lighting function, |Lb(0)| = tr is an n-dimensional 
ball. Anyway, a topological space, called continuous analogue, more suitable to 
represent such a continuous interpretation, can be derived from the light body 
of an object in our approach. 

These continuous analogues define the fifth level of the multilevel architecture 
associated with each digital space. For simplicity, we introduce here only the two 



Eladio Dominguez and Angel R. Frances 



extreme levels of this architecture for a given digital object O in a digital space 
(AT,/); see [2] for a complete description of these architecture. For this, we use 
an arbitrary but fixed centroid-map c : iF — > \ K\ on the device model K. 

The device level of a digital object O in (K, f) is the polyedral subcomplex 
K{0) = {a € K\ a < a € O} induced by O in K. Notice that the device level 
of an object is fixed for any lighting function on K; that is, the discrete object 
displayed in a digital image does not depend on the continuous interpretation we 
make of it. On the other hand, the continuous level of O may be different for each 
lighting function f on K. Namely, it is defined as the underlying polyhedron | Ao | 
of the simplicial complex Ao, whose fc-simplexes are (c(ao), c(ai), . . . , c(ak)} 
where ao < < ■ ■ ■ < at are cells in K such that f{0, at) = 1 (or, equivalently, 

S Lb(0)), for 0 < i < k. The simplicial complex Ao is called the simplicial 
analogue of O, and the polyhedron | Ao | is said to be the continuous analogue 
of O. 

Since continuous analogues are representing the continuous interpretation 
that an observer makes of digital objects, our architecture allows us to introduce 
digital notions in terms of the corresponding continuous ones. For example, we 
will say that an object O is connected if its continuous analogue | Ao \ is a con- 
nected polyhedron; and, O is also naturally said to be a digital manifold if | Ao \ 
is a combinatorial manifold. Doing that, it arises the problem of characterizing 
these notions in digital terms (see [2] for connectivity, [1],[2] for digital surfaces 
and [4] for the notion of digital fundamental group of a digital object). 

3 Kovalevsky’s Digital Spaces 

In this section we show the relationship between the cell complex approach of 
Kovalevsky [10] and our approach. 

Kovalevsky uses a n-dimensional abstract cell complex K to represent the 
spatial layout of xels in a digital image, where the n-cells in K are the xels. 
Recall that any abstract cell complex has a well defined topological structure. In 
particular, for a given complex K, the following combinatorial notion of connec- 
tivity is equivalent to the usual definition in general topology: a subset A C K 
is said to be connected if for each pair of cells a, f3 £ A there is a finite sequence 
(ao, ai, . . . , ak) in A such that ao = a, ak = P and either ai is a face of a^-i 
or ai-i is a face of a^, for 1 < * < fc. 

In order to avoid connectivity paradoxes ([7]), Kovalevsky points out that a 
digital image should not be defined by specifying only the set of xels belonging 
to the object O displayed in the image. Instead, a subset A{0) C K contain- 
ing (possibly) some lower dimensional cells of K must be associated with that 
object (naturally the only n-cells in A{0) correspond to xels in O). Under Ko- 
valevsky’s point of view any topological property (in particular, connectivity) 
of the object O is analyzed through the properties of the set A{0). But, in our 
opinion, some type of uniform methods to associate every digital object O with 
such a set A{0) is needed if our goal is to perform a formal development of the 
topological analysis of digital images. In fact, the idea of these uniform methods 
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Fig. 2. Kovalevsky’s spaces may not represent accurately our usual perception 
of objects 



is underlying the notion of “face membership rule” introduced by Kovalevsky 
in [10]. From this point of view, the theoretical development of Digital Topology 
take as basic structure the pair consisting of an abstract cell complex and a face 
membership rule. 

From the remarks above, Kovalevsky’s approach can be interpreted in our 
approach as follows. 

Definition 5. A Kovalevsky’s space is a pair (AT, Lb) where K is a polyhedral 
complex and Lb : 'P{celln{K)) V{K) is a map satisfying 

(!’) Lb{0) n celln{K) = O, for any digital object O (see Def. 1 and Remark 2). 

Notice, however, that this notion of digital space is too much general. For 
example, let us consider a digital object O C cell 2 (i?^) consisting of two pixels 
(2-cells) tr, T such that a = % (see Fig. 2). It is possible to find quite a lot 
Kovalevsky’s digital spaces (i?^,Lb) in such a way the set Lb(0) contains a 
sequence (oq, oi, . . . , at) with oq = cr, ak = t and Oi < Oi-i or < a^, 

1 < i < fc. Since Lb(0) is connected, O should be considered as a connected 
digital object in this space; but this is in contradiction with our usual perception 
of two isolated pixels. 

In this way, it seems advisable to restrict the category of Kovalevsky’s digital 
spaces to another category in which it is possible to find properties of general 
application and, moreover, whose digital spaces are not contradictory with our 
usual perception of digital images. 

4 Analyzing the Axioms of Lighting Fnnctions 

In this section we will consider light body maps; or, equivalently, lighting func- 
tions which does not necessarily satisfy all axioms given in Def. 1. Notice, how- 
ever, that for a given digital object O, we can use the set Lb(0) to replicate the 
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construction of a continuous analogue \Ao \ for O. But doing that, the continu- 
ous analogue of a digital object may be contradictory with our usual perception 
of that object. Actually, each axiom in Def. 1 prevents some of such anomalous 
analogues as we will next justify. 

4.1 Object Axiom 

To justify this axiom, we shall use a quite natural argument related to the 
connectivity of a digital object. 

As Kovalevsky, the notion of connectivity we will consider for digital objects 
is naturally induced by the face relationship among the cells of the device model. 

Definition 6. Let O he a digital object in a device model K and Lb a light body 
map defined on K. Two n-cells (xels) <j,t G O are said to be connected by a 
path in O if there exists a sequence (oq, Oi, . . . , ak) in Lb{0) such that oq = cr, 
ak = T, and either eXi-\ < exi or exi < oci-\, for 1 < i < k. A subset C O 
is a digital component of O if any pair of n-cells a,r G C are connected by 
a path in O and, in addition, no n-cell a € C is connected by a path in O to 
a n-cell in O — C . Finally, a digital object is said to be connected if it has only 
one component. 

Notice that the cell a\, in the previous definition, is necessarily a face of 
the n-cell a. So that, for each n-cell a G O, the map Lb determines what faces 
of a are allowed to connect a with any other n-cell in O. 

However, this natural notion of connectivity for a digital object O may not 
agree exactly with the connectivity of its continuous analogue |Ao|; that is, 
with our continuous interpretation of the connectivity of O. On one hand, if 
(T ^ Lb(0) for a xel a G O, the previous definition yields that C = {cr} is a 
digital component of O; but this component is not represented in the continuous 
analogue since c(cr) ^ | Ao \ ■ This problem disappears if the light body map Lb 
satisfies the following property: 

(T) for all O C cell„(A), O C Lb(0). 

On the other hand, if a cell a G Lb(0) — O is not connected by a path to 
some xel in O, then there is a component in | Ao | which does not correspond 
to any digital component of O. A sufficient condition to prevent this fact is the 
following: 

(A) for any cell a G Lb(0) there is a n-cell a GO such that a < cr. 

In particular, none of the n-cells in the background cell„(A) — O oi O should 
belong to Lb(0), since these cells are not faces of n-cells in O; otherwise, a path 
connecting two xels in O could intersect the background of O. 

For a light body map satisfying these two properties it is immediate to show 
that there is a one-to-one correspondence between the digital components of 
any digital object O and the connected components of its continuous analogue 
\Ao I; that is, the connectivity of digital objects, as it is defined in Def. 6, agrees 
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Fig. 3. In case a lighting function does not satisfy the support axiom, the con- 
tinuous interpretation provided by the continuous analogue | Ao \ of an object O 
may be contrary to our usual perception of that object 



with the connectivity of their continuous analogues. Moreover, properties (I’) 
and (A) together imply the object axiom: for any digital object O C cell„(AT), 
Lb(0) n cell„(AT) = O. 



4.2 Support Axiom 

Notice that, the object axiom and property (A) above are limiting the set of cells 
that may be lighted for a given digital object. In fact, these two conditions suffice 
to ensure that the two pixels in a digital object O = {u, r}, with tr H r = 0, are 
never connected by a path of lower dimensional cells “crossing” the background 
of O, as it could be the case in Kovalevsky’s digital spaces (see Fig. 2). 

However, the object axiom and property (A) do not suffice to avoid the 
following unusual continuous interpretation of a digital object. 

Let O = {cr, r} be a digital object in the device model with dimer n r = 1, 
and let Lb be a light body map defined on such that Lb(0) = {cr, r, /3i, /? 2 }, 
where f3i,P2 are the two vertices of the edge tr n r (see Fig. 3(a)). Notice that 
Lb can be defined for the rest of objects in in such a way it satisfies both 
the object axiom and property (A) above. However, the corresponding continu- 
ous analogue \Ao\ depicted in Fig. 3(b) is a continuous closed curve; but this 
continuous interpretation of O is contrary to our usual perception of that object. 

This anomalous example vanishes if, in addition to the n-cells cr and r, the 
light body Lb(0) is allowed to contain only the cell cr n r; that is, the only cell 
in supp(O). So, we require in general the support axiom: Lb(0) C supp(O) for 
any object O C cell„(AT). 

4.3 Monotone Axioms 

Monotone axioms, as well as local axioms analyzed in next section, are needed 
to ensure a certain compatibility between the continuous analogues of different 
digital objects. To show this, let us consider the digital object 0\ C cell 2 (i?^) 
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Fig. 4. A discrete digital space may contain connected digital objects consisting 
of two or more pixels in case the lighting function does not satisfy the weak 
monotone axiom 



shown in Fig. 4(a), and let Lb : 7^(cell2(i?^)) — *■ V(R^) be the light body map 
on the device model given by Lb(Oi) = supp(Oi) and Lb(0) = O for any 
other object O fy Oi. Notice that Lb satisfies the object and support axioms 
above. The continuous analogues \Aoi \ and \Ace\\ 2 {R'^) I of the objects Oi and 
cell 2 (i?^) (i.e., the object consisting of all the pixels in the device model R^), 
derived from this light body map, are those depicted in Figs. 4(b) and (d), re- 
spectively. Observe that | Aoi | is a connected straight line, while the polyhedron 
I'4ceii2(j?2) I is a set of discrete points. But this continuous interpretation is con- 
trary to our usual perception, since the only connected objects contained in a 
discrete ambient space are singletons. 

A sufficient condition to avoid this type of examples is the weak monotone 
axiom: 

For any object O C cell„(iF) in a device model K, Lb(0) C Lb(cell„(iF)). 

caption 

It is immediate to show that if a light body map Lb satisfies this axiom, 
then the continuous analogue of any object O is always a subpolyhedron of the 
continuous analogue of the space cell„(iF); that is, \Ao \ C \AceW^[K) I- But, for 
two given digital objects Oi C O 2 , we cannot derive in general that \Aox \ is a 
subpolyhedron of | A 02 \ ■ A necessary condition to show this fact is the strong 
monotone axiom: 

Lb(Oi) C Lb(02), for any pair of objects Oi C C> 2 - 

Within the graph-theoretical approach to Digital Topology, Kong and Roscoe 
([9]) define the notion of (a, /3)-surface on for (a, /3) G {(6, 26), (26, 6), (6, 18), 
(18, 6)}, generalizing the (6,26)- and (26,6)-surfaces of Morgenthaler and Rosen- 
feld [11]. In [1] we prove that the strong monotone axiom suffice to represent, 
in our framework, the continuous interpretation associated with these (a, (3)- 
surfaces, for (a, f3) G {(6, 26), (26, 6), (6, 18)}. More precisely, we show that there 
are digital spaces (R^, /«/?), where fa 0 is a strongly local and strongly mono- 
tone lighting function, such that the set of digital surfaces in these spaces co- 
incides with the set of (a, /3)-surfaces. Recall that a digital surface in a digital 
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space (AT, /) is a digital object S whose continuous analogue | As \ is a surface. 
However, it cannot be found a strongly monotone lighting function /is,6 on 
such that every (18, 6)-surface is a digital surface in (i?^,/i8,6)- Also in [1] we 
show that such a digital space exists in the category of strongly local but weakly 
monotone lighting functions. Up to now, we have not found a digital space whose 
digital surfaces coincide exactly with the set of (18, 6)-surfaces. We conjecture, 
however, that this space could be found in the category of lighting functions 
given in Def. 1. 

4.4 Local Axioms 

The argument to justify these axioms is related to the physical limitations of 
the human perception. Indeed, our usual perception of objects is always local. 
So that, the continuous interpretation we can make of some piece of a digital 
object should be independent of any other piece of that object which is not near 
enough to the first one. As a consequence, it seems reasonable to ask lighting 
functions to satisfy some type of local property. Notice that, in case the lighting 
function satisfies the object axiom, such a property is not needed for the set 
of n-cells in the device model since, in that case, any n-cell a is lighted for an 
object O if and only if a belongs to O. If, in addition, the lighting function also 
satisfies the support axiom, then that local condition is only necessary for lower 
dimensional cells in the support of an object O, since f{0,a) = 0 for any cell 
a ^ supp(O). 

The natural method to formalize such a local condition is to relate the lighting 
of a lower dimensional cell a G K for a given digital object O to a certain digital 
neighbourhood of a in O. There are two natural possibilities for this notion of 
digital neighbourhood: 

the star of a in O : st„(o;; O) = {a G O; a < cr}; and 

the extended star of a in O : st* (a; O) = {a G O; a f] a ^ 0}. 

Notice that the star of a in O coincides with the set st(a; AT) n O, where 
st(a; K) = {P G K; there is 7 G AT with a < 7 and /3 < 7} is related to the 
smallest open neighbourhood of a in the topology of AT when it is reagarded as 
an abstract cell complex; see [10]. However, the polyhedron |st(a; K) \ is not a 
neighbourhood of a in the Euclidean space | AT | . On the other hand, st* (a; O) = 
N{a; K) n O, where A(a; AT) = {/3 € AT; /3 < 7 G AT, 7 0 a yf 0} is the smallest 
subcomplex of AT which is also a neighbourhood of a in | AT | , see [13] . Depending 
on which one of these topologies we want to represent, we should use one of the 
two following axioms for a map / : 7^(cell„(Ar)) x V{K) {0, 1}: 

Strong local axiom : /(O, a) = /(st„(a; O), a) 

Weak local axiom : f{0,a) = /(st* (a; O), a) 

As it was quoted in the previous section, the category of strongly local and 
weakly monotone digital spaces suffices to represent the continuous interpreta- 
tion of digital objects associated with the notion of (a, /3)-surface. However, we 
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Fig. 5. The complement’s connectivity axiom is required to obtain a right rep- 
resentation of the connectivity of the background of any object 



show in [2] that, for any digital space {R^, /) in this category, the set of digital 
surfaces in {R^, /) does not contain the set of all strong 26-surfaces ([5]). On the 
other hand, such a digital space can be found in the category of weakly local 
digital spaces ([2]). 

4.5 Complement’s Connectivity Axiom 

As it is usual, we consider that the object displayed in a binary digital picture 
consists of the pixels with value 1, while the background of that picture is the set 
of the pixels with value 0. It is well known that the perception of a set of pixels, 
and so its topological properties, may be different if we consider that set as the 
object displayed in a picture or as the background in the complementary picture. 
This leads us to think that, for a given digital object O in a device model K, the 
properties of the background cell„(A) — O of O should not be analyzed by using 
the continuous analogue | v4ceii„(i<')-o since this polyhedron is representing the 
continuous interpretation of the set of pixels cell„(A") — O when it is considered as 
the digital object displayed in a digital picture. On the contrary, the continuous 
interpretation of the background of the object O is better represented by the 
complement |Aceii„(/f)| — \^o \ of its continuous analogue. In particular, the 
connectivity of this topological space should describe the connectivity of the set 
of pixels cell„(AT) — O when it is understood as the background of O. 

In [I] we introduce a “digital” notion for the connectivity of the background 
of an object O at the device level of a digital space {K,f). And, moreover, we 
show that in case the map / is strongly local, weakly monotone and if, in addi- 
tion, it satisfies both the object and support axioms, then there is a one-to-one 
correspondence between the digital components of the background cell„(AT) — O 
and the connected components of the topological space | Aceii„(if) | — | Ao \ ■ How- 
ever, next example shows that, in general, this correspondence does not hold for 
a map / satisfying only the weak local axiom. 

Let us consider the map / : 7^(cell2(i?^)) x V{R?) {0,1} defined on the 

device model R^ by f{0,a) = 1 if and only if st 2 (a;i?^) C O. Notice that / 
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satisfies all axioms in Def. 1 but the complement’s connectivity axiom. Now, let 
O = — {a, t} be the digital object consisting of all the 2-cells in but two 

of them a, r as it is shown in Fig. 5(a). It is easily checked that, for the map /, 
the continuous analogue of the object cell 2 (i?^) is the whole Euclidean plane, 
while \Ao I is the polyhedron depicted in Fig. 5(b). From these facts, we get that 
I^ceii 2 (fl 2 ) I ~ l“^0 I is a connected space, which does not correspond with our 
usual perception of the background {a, r} of O consisting of two isolated pixels. 

This paradoxical example can never occur for a lighting function /; that is, 
a function satifying the object, support, weak monotone and weak local axioms 
and, in addition, the complement’s connectivity axiom in Def. 1: 

For objects O' C O C cell„(iF) and a cell a £ K such that st„(a;0) = 
st„(a; O'), f{0', a) = 0 and /(O, a) = 1, then: (a) the set of cells a{0'- O) = 
{/3 < a;f{0',(3) = 0, f{0,(3) = 1} is not empty; (b) the set U{,5;/3 G 
a{0'\ O)} is connected in da- and, (c) if O C O C cell„(AT), then /(O, /3) = 1 
for every f3 G a(0'; O). 

Actually, if O is an object in a digital space {K, f) such that / satisfies all the 
axioms in Def. 1, then there is a one-to-one correspondence between the digital 
components of cell„(A") — O and the connected components of | -4ceii„(iC) | — | Ao |, 
as in the case of the strongly local lighting functions (see [2]). Finally, notice 
that the strong local axiom implies the complement’s connectivity axiom, since 
f{0,a) = /(st„(a; O), a) = 1 and f{0',a) = /(st„(a; O'), a) = 0 yield that 
st„(a; O) yf st„(a; O'), and so no cell a can satisfy all the requirements in the 
complement’s connectivity axiom. 

5 Conclusions 

Along the previous section we have given natural reasons justifying the neces- 
sity of several axioms to limit a general notion of digital space to a category 
in which certain paradoxical interpretations of digital objects cannot be done. 
We have pointed out various relations among these axioms. And we have also 
recalled some of the results reached using them, which are showing that our 
approach to digital topology is expressive enough to represent the continuous 
interpretation associated to several notions of digital surface introduced within 
the graph-theoretical approach to digital topology. Actually, these digital sur- 
faces are particular cases, for suitable digital spaces, of a general notion of digital 
manifold that naturally arises in our approach. 

As the starting point for the quoted justification, we have observed that 
Kovalevsky’s approach to digital topology can be understood in our framework 
as the category of the digital spaces satisfying only the object axiom. 

Finally, it is worth to point out that the topological spaces on the grid 
proposed by Khalimsky [6] as a model for digital pictures are isomorphic to 
the conceptual level of suitable digital spaces defined on the device model i?” 
(see [2]). 
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Abstract: Cell eomplexes are potentially very useful in many fields, in- 
cluding image segmentation, numerieal analysis, and computer graph- 
ics. However, in practice they are not used as widely as they eould be. 
This is partly due to the difficulties in actually implementing algorithms 
on top of eell complexes. We propose to use generic programming to 
design eell complex data structures that are easy to use, efficient, and 
flexible. The implementation of the new design is demonstrated for a 
number of common cell complex types and an example algorithm. 



1 Introduction 

Cell complexes and related structures (such as combinatorial maps) are fundamental 
concepts of digital topology. They are potentially very useful in many application 
areas. For example, they can be used in image analysis to represent a segmented 
image, in eomputational geometry and computer graphics to model 2- and 3-dimen- 
sional objects, and in numerical analysis to build finite element meshes. Yet, in 
praetice they are not used as widely as one would expect. I believe that this is partly 
due to the difficulties their aetual implementation and applieation poses to the pro- 
grammer. In this paper I will present a solution to this problem that was originally 
developed in the eontext of image analysis. It is interesting to note that independent 
investigations in other fields (e.g. computational geometry and numerical analysis) 
arrived at very similar solutions [2,5]. Although these fields have quite different 
goals, the generic concepts to be described here seem to be equally helpful. 

In the field of image analysis, image segmentation is the most important applica- 
tion for cell complexes. On the basis of segmented images, questions like “What are 
the eonnected regions?”, “Where do regions meet?”, “Where are the edges?” etc. will 
be answered. It is well known that naive representations of segmentation results, such 
as binary images or edge images, have problems in answering such questions consis- 
tently and without topological contradictions. 

The most infamous problem is the so called connectivity paradox which occurs 
when one defines connected regions by determining connected components in a 
binary image. No matter whether 8- or 4-neighborhood is used while constructing the 
connected components, there are configurations that violate Jordan’s curve theorem. 
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That is, in this representation it is not always assured that a closed curve partitions the 
plane into exactly two distinct connected components (see e.g. [8]). 

Kovalevsky [8] has shown that problems like this can be overcome by using topo- 
logical cell complexes (formal definition below). In particular, planar cell complexes 
(i.e., 2-complexes that can be embedded in the plane) are appropriate for image 
analysis. On a theoretical level, researchers generally agree on the usefulness of cell 
complex representations, but in practice they are quite rarely used. Instead, most 
segmentation methods deal with topological problems heuristically, if at all. I’ll 
demonstrate the difficulties in practical application of cell complexes by means of two 
typical examples: the cell complexes defined in the TargetJr image analysis frame- 
work [13] and the combinatorial maps defined by Dufourd and Puitg [3]. These 
examples show that current software either requires a lot of manual work to build and 
manipulate a cell complex or are very slow in execution. 

The first example are the topological classes of TargetJr [13]. TargetJr is a very 
popular basic framework for image analysis research and, to a degree, represents the 
state-of-the-art in object-oriented image analysis software. It is written in C-t-l- and 
thus allows for very fast algorithms and a moderate degree of flexibility. In the area of 
topology, it provides basic building blocks for 2-dimensional cell complexes, most 
notably: Vertex, Edge, Face, OneChain. The former three represent the different 
cell types, while the latter encodes a sequence of connected edges, which e.g. repre- 
sent the boundary of a face. In principle, it is possible to construct any planar cell 
complex by using these primitives. But in practice this is very hard, because the user 
is responsible for ensuring consistency of the resulting complex - there is no class 
Cell Comp lex that would help the programmer in constructing a topological struc- 
ture. In the words of the TargetJr documentation: 

“For operations that must reuse the vertices [...] the programmer is responsible for 
maintaining their consistency with the topological structure as a whole.” [13], 
http://www.targetjr.org/manuals/Topology/nodel8.html 

Handling is further complicated by the lack of separation between the combinatorial 
and geometrical structures of the complex - all topology classes also carry geometric 
information (e.g. coordinates in case of vertices). Furthermore, when the cell complex 
has finally been constructed, navigation on it is also quite hard. For example, there is 
no obvious way of getting the ordered cycle of edges around each vertex. Essentially, 
TargetJr thus leaves the most difficult tasks to the user. 

A totally different approach to the construction of cell complexes is described by 
Dufourd and Puitg [3], my second example. They use the notions of dart and orbit to 
define quasi-maps, which contain planar maps (planar cell complexes) as special 
cases (formal definitions below). This theory is formally specified and implemented 
in a functional programming languages, which leads to a very elegant and easy to use 
solution. For example, the orbits can be retrieved by simple functions, so that naviga- 
tion on the map is very easy. Since all required constraints are automatically enforced 
during map construction, inconsistent structures cannot be build. This takes a lot of 
responsibility from the user. 

However, this approach also has some disadvantages. First, it is very difficult to 
integrate it with other image analysis software: most image analysis software isn’t 
written in a functional language, and it is difficult to connect software components 
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Fig. 1. Example for a planar cell 
complex. Some cells are marked with 
their types. The degree (= number of 
1 -cells bounded) is given for some 
0-cells 



written in different paradigms and languages. Second, according to the definitions in 
[3] even the simplest operations (getting the successor of a dart in an orbit) take linear 
time in the size of the cell complex. This leads to unacceptably slow algorithms when 
cell complexes of realistic size (thousands of faces) need to be processed. It is not 
clear to me whether a more efficient implementation (with constant complexity in the 
basic operations) is possible in a functional environment. 

This paper proposes to use generic programming in order to overcome the prob- 
lems mentioned. It describes possibilities to define operations on cell complexes in a 
way that is simultaneously efficient, flexible and easy to use. These definitions will 
take on the form of abstract interfaces. It will also be shown how these interfaces can 
be implemented on top of different topological data structures. 



2 Cell Complexes and Related Topological Structures 

Discrete topological structures can be defined in multiple ways, giving rise to differ- 
ent data structures. However, from the viewpoint of algorithms that are to operate on 
top of those data structures, the differences are not very significant: many topological 
data structures provide essentially very similar functionality to algorithms. This is the 
basic justification for our attempt to define a uniform abstract interface.' 

Let us briefly review some common definitions of digital topology (detailed treat- 
ment is beyond the scope of this paper). The best known is the cellular complex [8]: 

Definition 1: A cell complex is a triple (Z, dim, B) where Z is a set of cells, dim is 
a function that associates a non-negative integer dimension to each cell, and 
B (Z Zx Z is the bounding relation that describes which cells bound other cells. 
A cell may bound only cells of larger dimension, and the bounding relation 
must be transitive. If the largest dimension is k, we speak of a k-complex. 

Two cells are called incident if one bounds the other. The bounding relation is used to 
define open sets and thus induces a topology on the cell complex: a set is called open 
if, whenever a cell z belongs to the set, all cells bounded by z do also belong to the 
set. In the context of image segmentation we are particularly interested in cell com- 
plexes with maximum dimension 2 which can be embedded in the plane. We will call 



The correspondences between the considered models will be formally studied elsewhere. 
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these planar cell complexes. Figure 1 shows a planar cell complex. We will use the 
terms node, edge, and face synonymously to 0-, 1-, and 2-cells respectively. 

Another popular definition is that of a combinatorial map [3]. This definition will 
prove especially useful for our interface definition and implementation: 

Definition 2: A combinatorial map is a triple {D, a, d) where Z) is a set of darts 
(also known as half-edges), <T is a permutation of the darts, and a is an involu- 
tion of the darts. 

In this context, a permutation is a mapping that associates to each dart a unique 
predecessor and a unique successor. By following successor chains, we can identify 
the cycles or orbits of the permutation. An involution is a permutation where each 
orbit contains exactly two elements. It can be shown that the mapping cr * a is also 
a permutation. The correspondence between combinatorial maps and cell complexes 
is established as follows: we associate each orbit in <7 with a 0-cell (node), each orbit 
in a with a 1-cell (edge; an edge is thus a pair of half-edges), and each orbit in ^ with 
a 2-cell (face) of a cell complex. A thus defined cell bounds another one (of higher 
dimension), if their corresponding orbits have a dart in common. 

In the context of rectangular rasters, Khalimsky’s topology is very useful [6]: 

Definition 3: The Khalimsky topology of the 1 -dimensional discrete line is ob- 
tained by interpreting every other point as an open or closed set respectively. 
The topology of a 2-dimensional rectangular raster is constructed as the product 
topology of a horizontal and vertical topological line. 

To interpret the topological line in the terminology of cell eomplexes we associate 
dimension 1 to the open points and dimension 0 to the closed points. The dimensions 
in the topological raster are obtained as the sum of the vertical and horizontal dimen- 
sions. Each cell bounds the cells in its 8-neighborhood that have lower dimension. 
Thus, a 0-cell bounds all its 8-neighbors, and a 1-cell bounds two 2-cells. Figure 2 
(left) shows a raster defined like this. 

Finally, when it comes to iterative segmentation and irregular pyramids, the block 
complex is a very useful notion. The idea is to build a new cell eomplex on top of an 
existing one by merging cells into larger block cells, or blocks for short. Blocks are 
defined by the property that they either contain a single 0-cell, or are homeomorphic 
to an open k-sphere. (A block is homeomorphic to an open k-sphere if it is isomorphic 
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Fig. 2. left: Khalimsky topology on a square raster (0-cells: black balls, 1 -cells: 
black lines, 2-cells: gray squares); right: block complex defined on top of a 
Khalimsky raster (0-blocks: large black balls, 1 -blocks: black lines and small 
balls, 2-blocks: gray squares, rectangles and balls) 
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to a simply connected open set in some k-complex - see [9] for details.) A block’s 
dimension equals the highest dimension of any of its cells. 

Definition 4: A block complex is a complete partition of a given cell complex into 
blocks. The bounding relation of the blocks is induced by the bounding relation 
of the contained cells: a block bounds another one iff it contains a cell that 
bounds a cell in the other block. 

Note that the induced bounding relation must meet the requirements of definition 1, 

i.e. blocks may only bound blocks of smaller dimension, and the relation must be 
transitive. This poses some restrictions on valid partitions. In particular, all 1 -blocks 
must be sequences of adjacent 0- and 1-cells, and all junctions between 1-blocks must 
be 0-blocks. See figure 2 right for an example. 

3 Generic Programming 

Generic programming is a new programming methodology developed by A. Stepanov 
and D. Musser [12]. It is based on the observation that in traditional programming 
much time is wasted by continuously adapting algorithms to ever changing data 
structures. This becomes a significant problem in application areas where algorithms 
play a crucial role. Here, algorithms should be considered as independent building 
blocks rather than being dependent on specific data structures. Therefore, generic 
programming puts its main emphasis on separating algorithms from data structures 
by means of abstract interfaces between them.^ Since algorithms indeed play a central 
role in image analysis and computational topology, generic programming suggests 
itself as a suitable software design method in these fields. 

The algorithm centered approach of generic programming consists of 5 steps: 

1 . Analyze the requirements of algorithms in the intended field. Ensure that each 
algorithm imposes minimal requirements on the underlying data structures. 

2. Organize the set of requirements into concepts. A concept contains a minimal 
set of functionality that is typically needed together. 

3. Define abstract interfaces for all concepts. Map the definitions onto appropriate 
generic constructs of the target programming language (e.g. templates in C++). 

4. Implement the interfaces on top of suitable data structures. A data structure 
may implement several concepts, and each concept may be realized by several 
data structures. 

5. Implement algorithms in terms of abstract concepts (interfaces). Thus, algo- 
rithms can run on top of any data structure that implements the concepts. 

The currently best example for this approach is the design of the Standard Template 
Library [1] which forms a major part of the C++ standard library. This library deals 
with fundamental data structures (list, arrays, trees etc.) and algorithms (sorting, 
searching etc.). On the basis of the steps sketched above, Stepanov and Musser 
arrived at the iterator concept as their fundamental interface abstraction. Because of 
its importance for our own discussion, we will describe iterators in some detail. 



^ This extents the traditional theory of abstract data t5qies and object-oriented programming, 
where abstract interfaces have only been defined for data stmctures and objects. 
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Fig. 3. Effect of a call ++iterator on different data structures, top left: array - the iterator 
is moved to the next memory location; bottom left: linked list - the iterator follows the link 
to the next item; right: tree - the iterator realizes a pre-order traversal (black arrows denote 
links in the data structure, gray ones show the path of the iterator) 

3.1 Iterators 

Algorithm analysis revealed that many algorithms need to sequentially access the 
individual data items in a container. Although there exists a wealth of different 
sequences (such as arrays, deques, and lists), most algorithms use them in very simple 
ways. For example, they access one data item, process it, and then proceed to the next 
or previous item. It was Stepanov and Musser’s key insight that the commonalties can 
be reinforced and the differences be encapsulated by means of iterators. In the exam- 
ple, a bi-directional iterator would be required which has the following capabilities^ 

1 . It points to a specific item in a specific container, and allows reading and possi- 
bly writing the data of that item. In C++, this is realized by the dereferencing 
operator: item = * iterator 

2. It can be moved to the next and previous items. In C++, this is done using the 
increment and decrement operators: ++iterator and - -iterator 

3. Two iterators can be compared to find out whether they point to the same item. 
This is realized by the quality operator: iterator == end_iterator. 

(This is used to determine the range of iteration: each data structure provides a 
pair of iterators that point to the start and one-past-the-end of the sequence re- 
spectively. The former iterator is incremented until it compares equal to the 
latter one, which signals the end of the sequence.) 

Key to the efficiency of algorithms using these iterators is the additional requirement 
the all four operations must execute in constant time. Iterators meeting these require- 
ments can be implemented for many different data structures, as shown in fig. 3. In 
case of an array, the iterator points to a memory location, and the increment operation 
moves it to the next memory location. In case of a linked list, a pointer to the next 
item is stored together with each item, the increment operator follows this link. For a 
tree, one can implement different iterators that realize pre-, post-, and in-order tra- 
versals respectively, by following the edges of the tree in the appropriate order. 

Algorithms are then defined so that they use only iterators, not the underlying data 
structures. By implementing algorithms as C++ templates, it is possible to leave the 



^ In addition, Stepanov and Musser define input, output, forward, and random access iterators. 
These will not be needed in this paper. 
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iterators’ types open until the algorithm is actually applied to a concrete data struc- 
ture. Only then will the compiler replace the formal (place-holder) type of the iterator 
with an actual type. For example, the signature of the copy ( ) -algorithm which copies 
the items from the source sequence to the target sequence, looks like this: 

template <class Sourcelterator , class TargetIterator> 
void copy (Sourcelterator src_begin, Sourcelterator src_end, 
Targetiterator target_begin) ; 

For further information about the design of the STL and generic programming in 
general, [1] is recommended. 

In case of cyclic data structures, the concept of an iterator (which has a first and 
last element) is replaced by the concept of a circulator [5]. A circulator behaves like 
an iterator, with the exception that there is no special last element where further 
incrementing is forbidden. Moreover, a circulator needs an additional function 
iterator . isSingular ( ) which returns true iff the cyclic sequence is empty. The 
circulator concept will be important in the context of cell complexes as their orbits 
form cyclic sequences. 



4 Generic Concepts for Cell Complexes 

The development of generic concepts for cell complexes must start with an analysis of 
algorithm requirements. This has been done by several researchers, e.g. [2, 5, 7]. To 
illustrate this process I will present a simple segmentation algorithm. A more exten- 
sive discussion of generic topological algorithms is beyond the scope of this paper 
and will be given elsewhere. 

The algorithm “Connected Components Segmentation” groups 2-cells together 
according to a predicate are_similar(2-cell, 2-cell) that is applied to all adjacent 2-cells 
(2-cells are adjacent if they are bounded by a common 1-cell). The exact nature of the 
predicate is application dependent and thus left unspecified here. A unique label is 
assigned to the 2-cells in each connected region of “similar” cells. After labeling all 
2-cells according to similarity, labels are assigned to 0- and 1 -cells according to the 
following rule: if a 0- or 1-cell bounds only 2-cells with the same label, it is assigned 
this same label, otherwise it is assigned a special BORDER_LABEL. It should be noted 
that this algorithm essentially creates block complexes according to definition 4, with 
the exception that 2-blocks need not be simply connected but may have holes. 

In Table 1 we give the pseudo-code of the algorithm on the left and corresponding 
requirements on the right. Although this simple algorithm does not exploit the entire 
functionality of a cell complex, it illustrates nicely how sets of abstract requirements 
are formulated: the algorithm can be applied to any data structure that supports the 
operations listed on the right. In order to maximize the scope of the algorithm these 
operations should represent minimal requirements. For example, the algorithm doesn’t 
care in which order the 1 -cells incident to a 0-cell are listed. So it would be wrong to 
require those cells to be listed in a particular ordered (say, in orbit order), because this 
would needlessly restrict the applicability of the algorithm. 
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Table 1. Connected components segmentation algorithm 



Pseudo-code 

// find connected 2-cells of single compo- 
nent 

define recurse_component(2-cell c): 
for each adjacent 2-cell ac of c do: 
if label(ac) != NO_LABEL: 

continue // ac already processed 
eise if are_similar(c, ac): 
label(ac) = label(c) 
recurse_component(ac) 

// find all connected components and 
// label cells accordingly 
define cc_segmentation(cell_complex): 
for each cell c in cell_complex do: 
label(c) = NO_LABEL 
maxiabel = NO_LABEL + 1 
//phase 1: 2-cell labeling 
for each 2-cell c2 in cell_complex do: 
if Iabel(c2) != NO_LABEL: 

continue // c2 already processed 
eise: 

Iabel(c2) = maxiabel 
maxiabel = maxiabel + 1 
recurse_component(c2) 

//phase 2: 1 -cell labeling 
for each 1-cell cl in cell complex do: 
labell = label(flrstBounded2Cell(c1 )) 
Iabel2= label(secondBounded2Cell(c1 )) 
if leftLabel == rlghtLabel: 
label(cl) = leftLabel 
else: 

label(cl) = BORDER_LABEL 
//phase 3: 0-cell labeling 
for each 0-cell cO in cell_complex do: 
for each incident 1-cell Ic of cO do: 
if label(lc) == BORDER_LABEL: 
label(cO) = BORDER_LABEL 
break 
else: 

label(cO) = label(ic) 



Data structure requirements 



// list adjacent 2-cells of given 2-cell 
// read integer attribute of 2-cell 

// evaluate predicate 

// assign integer attribute to 2-cell 



// find all cells in a cell complex 
// associate initial attribute with cells 



// list all 2-cells of a cell complex 
// read integer attribute of cell 



// assign integer attribute to cell 



// list all 1 -cells of a cell complex 
// access the two 2-cells bounded ... 

// ... by the current 1-cell 

// assign integer attribute to cell 

// assign integer attribute to cell 

// list all 0-cells of a cell complex 
// list 1 -cells bounded by current 
0-cell 

// assign integer attribute to cell 



// assign integer attribute to cell 



Of course, it is easy to keep requirements abstract in pseudo-code. But traditional 
programming techniques, especially procedural programming, do not allow to carry 
over the abstraction into executable code: they force the programmer to fix a particu- 
lar data structure so that the algorithm’s potential flexibility cannot be realized in 
practice. This is not the case in generic programming and its realization by C++ 
templates - algorithms are independent of concrete data structures so that require- 
ments can be translated into executable code without losing abstraction. 
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Analysis of topological algorithms revealed the following recurring requirements: 

Combinatorial Requirements: 

Analysis of the combinatorial structure 

• list all k-cells of a cell complex 

• list all incident cells for a given cell (in any order or in the cyclic order de- 
fined by definition 2) 

Modification of the combinatorial structure 

• elementary transformations between cell complexes 

Data Association Requirements: 

Assignment of data to cells and group of cells 

• temporary data (e.g. label of the connected component each cell belongs to) 

• geometric data (coordinates, model parameters) 

• application data (requirements vary extremely from application to applica- 
tion, e.g. regions statistics, object interpretations, textures etc.) 

It should be noted that this analysis suggests a strict separation between topological 
and geometrical properties. For a given topological structure, there are many different 
ways to represent its geometry - for example by using polygons, implicit equations, 
or sets of pixels/voxels. Separation of concerns therefore requires to treat topology 
and geometry as independent dimensions of abstraction. We will not explicitly deal 
with geometry in this paper, but concentrate on topology. However, it should be 
pointed out that representation of geometry is a chief application of data association 
with cells as described in the next subsection. 



4.1 Data Association with the Cells 

There are three basic possibilities to associate data with the cells of a cell complex: 

Store Data in the Cell Data Structure: In addition to the information that en- 
codes the combinatorial structure of the cell complex, application data can be 
stored directly in the cells. This data is retrieved by member functions of the 
cell objects. Templates are a great help in designing a cell complex that way. 

Store Data outside the Cells: This technique requires to store a unique ID with 
each cell. This ID is used to retrieve additional application data from external 
data structures such as hash tables. This technique is more flexible than the first 
because the cell data structures need not be modified to add a new attribute, but 
it may be slower and requires more care from the user. 

Calculate Data on Demand: Some data are redundant and need not be stored at 
all. If needed, these data can be calculated from other data that was stored. For 
example, the length of an edge can often be computed as the distance between 
the start and end node of the edge. 

Now we want to hide these possibilities behind an interface, so that algorithms need 
not know how the data are stored. This is important, because otherwise all algorithms 




26 Ullrich Kothe 



would have to be adapted when data must be stored differently. A suitable technique 
to achieve this is the so called data accessor [10]. 

A data accessor is a helper object that retrieves an attribute associated with a data 
item, given a handle to the item. It is similar to a member function in that it hides how 
the attribute is stored, but it is more flexible because it is independent of a particular 
data structure. As an example, let’s consider data accessors for the length of an edge 
according to each of the three possibilities above: 

struct LengthStoredInCellAccessor { 
float get (EdgeHandle edge) { 

// call member function to retrieve length attribute 
return edge . length ( ) ; 

} 

}; 

Struct LengthStoredExternallyAccessor { 

Hashtable edge_length; 
float get (EdgeHandle edge) { 

// use edge ID to retrieve length from hash table 
return edge_length [edge . ID { ) ] ; 

} 

}; 

Struct LengthCalculatedAccessor { 
float get (EdgeHandle edge) { 

// find differenc vector between start and end node 
Vector diff = edge . startNode (). coordinate ( ) - 
edge . endNode ( ) . coordinate ( ) ; 

// calculate length as magnitude of difference vector 
return dif f . magnitude () ; 




The important point about these accessors is that they are used uniformly by algo- 
rithms: algorithms uniformly call lengthAccessor . get (edgeHandle) , without 
knowing which accessor is currently used. Iterators and circulators will play the role 
of cell handles, as they always refer to a “current” element. See [10] for more details. 



5 Generic Concepts for Analyzing the Combinatorial Structnre 

Concepts for analyzing the combinatorial structure are the most interesting part of the 
cell complex interface. We will define this interface on the basis of the combinatorial 
map’s darts and orbits. This doesn’t imply that the interface is not applicable to other 
topological structures, it just simplifies the definitions. 

Iterators and circulators will form the central parts of the interface. First, we need 
iterators that simply list all cells of a given dimension: 

k-Cell Iterator: A k-cell iterator (k=0, 1, 2) iterates over all k-cells in a cell com- 
plex, in an arbitrary but fixed order. 

Second, we need iterators for the different orbits that determine the combinatorial 
structure. Since orbits are cyclic, we actually need circulators here: 

Orbit Circulator: An orbit circulator lists the darts in one of the orbits (o, a, (p). It 
is bi-directional, i.e. it can step to the successor and predecessor of a given dart. 
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Fig. 4. Incrementing a (p-circulator which goes around the interior face of the graph, 
left: initial position; center: intermediate position after a-iteration of embedded cir- 
culator; right: final position after c-iteration of embedded circulator 



For all iterators and circulators, we will use the usual ++i and --i notation to step 
forward and backward. The only exception is the a-circulator: since each a-orbit 
contains only two darts, we use the call i . jumpToOpposite ( ) to switch between 
the two elements of an a-orbit, i.e. between the two ends of an edge. Circulators can 
thus provide all three operations simultaneously, so that a- and (p-circulators auto- 
matically play the role of a-circulators as well. This will simplify many algorithms. 

Now we are already able to give a completely generic implementation of the 
(p-circulator which encodes the orbit of darts around a face: since the (p-permutation 
can be composed from the other two, we only need a o-circulator (which always is an 
a-circulator as well) to build it. Thus, we store a o-circulator within the (p-circulator, 
and all (p-operations are implemented in terms of the appropriate O and a-operations:'^ 

template <class SigmaCirculator> 

class PhiCirculator { 

SigmaCircultor sigma; 
public : 

// pass the embedded a-circulator 
PhiCirculator (SigmaCirculator contained) 

: sigma (contained) 

{} 

// implement the (p- increment as a’^a of the embedded circulator 
void operator++{) { 

contained. jumpToOpposite {) ; // one a iteration 
--contained; // one backward a iteration 

} 

// implement the (p-decrement as a of the embedded circulator 
void operator++{) { 

++contained; // one a iteration 

contained . jumpToOpposite {) ; // one a'^ iteration 

} 

... // more functions 

}; 

This code works regardless of what type the contained circulator has and what kind of 
cell complex we are using. Figure 4 illustrates a single (p-increment. 



* We use C++ for the implementation examples on two reasons. First, at present there is no 
language-independent abstract notation for generic components. Second, it demonstrates that 
the concepts presented can be and actually have been implemented in practice. 
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Fig. 5. Half-edges and their linking into 
orbits 



5.1 Implementation of the a-Circulator for a Half-Edge Data Structure 

Now, we want to explain the implementation of the o-circulator for different kinds of 
cell complexes. We start with a half-edge data structure, because this is the most 
direct implementation of a combinatorial map. The central concept of the half-edge 
data structure is the half-edge or dart, which explicitly stores pointers to its successors 
and predecessors in both the a- and a-orbits. A typical implementation of a half-edge 
looks like this (see also figure 5): 

struct HalfEdge { 

HalfEdge * sigmanext, * sigma_previous , * alphaother ; 

... // more data not relevant here 

}; 

Since the half-edge stores all necessary navigation information, the a-circulator can 
simply store a pointer to the current half-edge. Upon a call to the increment operator it 
will look up the appropriate neighboring half-edge and replace the stored half-edge 
with this one. An initial half-edge is passed to the circulator in the constructor: 

class Half EdgeSigmaCirculator { 

HalfEdge * current; 
public : 

Half EdgeSigmaCirculator (Half Edge * initial) 

: current { initial ) 

{} 

void operator++{) { 

// goto successor in a orbit 
current = current- >sigma_next ; 

}; 

void operator-- {) { 

// goto predecessor in a orbit 
current = current ->sigma_previous; 

}; 

void jumpToOpposite { ) { 

// goto successor {= predecessor) in a orbit 
current = current- >alpha_other; 

}; 

... // more functions 

}; 

This circulator gives us access to the complete combinatorial structure of the map: any 
dart, any orbit, and thus any cell can be reached by an appropriate operation sequence. 
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Fig. 6. left: four possible directions of the KhalimskySigmaCirculator 
in the o-orbit; right: two possible directions of the circulator in the a-orbit 




5.2 Implementation of a-Circulator for Khalimsky’s Topology 

Since Khalimsky’s topology is defined on a rectangular grid, we can use the grid 
coordinates to uniquely identify each cell. Note that coordinates are only used in the 
implementation and not exposed through the circulator interface. We assume that 
points whose coordinates are both odd are considered 0-cells. Then, points with two 
even coordinates are 2-cells, and the remaining points 1 -cells. A dart is an oriented 
1-cell. Hence, we add the attribute “direction” to distinguish the two darts of a 1-cell. 
Since there are four possible directions in a rectangular grid, we will denote directions 
by the integers 0...3 to encode “right”, “up”, “left”, and “down” respectively. 

Thus, a dart is uniquely determined by its direction and the coordinate of its start 
node. The o-orbit around that node is obtained by incrementing the direction modulo 
4. Similarly, the opposite dart in the a-orbit is found by moving the dart’s coordinate 
two pixels in its direction (namely to the end node of the corresponding 1-cell) and 
reversing the direction. Figure 6 shows the possible positions of the circulator in the 
o- and a-orbits. This leads to the following code for the o-circulator: 

class KhalimskySigmaCirculator { 
int X, y, direction; 
public : 

// init with a coordinate and a direction 
KhalimskySigmaCirculator { int ix, int iy, int idir) 

: x(ix), y(iy), direction (idir) 

{} 

void operator++() { 

// next direction counter-clockwise 
direction = (direction +1) % 4; 

} 

void operator-- 0 { 

// next direction clockwise 
direction = (direction +3) % 4; 

} 

void jumpToOpposite ( ) { 



Static int xStep[] = { 


2,0,- 


-2, 


0}; 




static int yStep[] = { 


0, -2, 


0, 


2}; 




X += xStep [direction] ; 






// 


go to the other end of 


y += yStep [direction] ; 






// 


the current 1-cell 


direction = (direction 


+ 2) % 


4 ; 


// 


reverse direction 



} 

... // more functions 

}; 
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Fig. 7. left: a block complex (cf. figure 2 right); right: possible positions of the block 
complex o-circulator around the black 0-cell - only two 1 -cells (black) belong to a 1 -block, 
the other two (gray) are ignored because they belong to a 2-block 




5.3 Implementation of the a-Circulator for a Block Complex 

Finally, we want to demonstrate how the o-circulator can be implemented for a block 
complex. A block complex differs from the two cell complex implementations de- 
scribed so far by the fact that blocks are made up of several cells of an underlying cell 
complex. Therefore, we can use the circulators of the underlying cell complex to 
implement the block complex’s circulators. Since the circulators have identical 
interfaces for all cell complex types, the block complex circulators can be written 
completely generically, independent of the type of the underlying cell complex. 

In order to analyze the structure of a block complex, we need some means to tell 
which kind of block each cell belongs to. Therefore, we require a data accessor that 
returns the dimension of the block for each 0- and 1-cell (2-cells always belong to 
2-blocks). For each underlying cell complex, this accessor is the only externally 
visible class that must be implemented in order to create a block complex: 

struct BlockDimensionAccessor { 

int dimensionOfEdge (SigmaCirculator) { 

... // suitable implementation for underlying cell complex 

} 

int dimensionOfNode (SigmaCirculator) { 

... // suitable implementation for underlying cell complex 

} 

}; 

The second function will return the dimension for the start node of the corresponding 
dart. Then the o-orbit of a given 0-block is identical to the o-orbit of the underlying 

0- cell, except that all 1 -cells belonging to a 2-block are ignored (see figure 7). 

The navigation in an a-orbit is slightly more difficult: given one end of a 1 -block, 
we must find the 0-cell at the other end of this 1 -block. This is essentially a contour 
following algorithm: check if the end node of the current dart is a 0-block. If yes, we 
have found the other end of the 1 -block. Otherwise jump to the next dart in the 

1 - block and repeat (see figure 8). Therefore, the complexity of an a- increment in a 
block complex is proportional to the length of the path that constitutes the current 
1 -block. The following circulator implementation realizes this algorithm (in 
jumpToOpposite ( ) ), as well as the selection of darts for the o-orbit (in 
operator++ ( ) and operator- - ( ) ): 
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Fig. 8. Contour following 
when jumpToOpposite {) 
is called on a block com- 
plex circulator (top left: 
initial position, left to right: 
intermediate positions, 
bottom right: final position) 



template <class SigmaCirculator , class BlockDimensions> 
class BlockComplexSigmaCirculator { 

SigmaCirculator embeddedSigma ; 

BlockDimensions blockDimension; 
public : 

// embed a circulator for the underlying cell cimplex and a 
// block dimension accessor 

BlockComplexSigmaCirculator (SigmaCirculator embedded, 

BlockDimensions dims) 

: embeddedSigma (embedded) , blockDimension (dims ) 

{} 

void operator++() { 
do { 

// increment embedded circulator . . . 

++embeddedSigma ; 

// ... but ignore 1-cells that are not in 1-blocks 
} while (blockDimension. dimensionOfEdge (embeddedSigma) 1= 1) ; 

} 

void jumpToOpposite ( ) { 

// contour following; 

// goto the other end of the current 1-cell 
embeddedSigma . jumpToOpposite ( ) ; 

// until we find the other end of the 1-block . . . 
while (blockDimension. dimensionOfNode (embeddedSigma) != 0) { 

// ... follow the cell chain in the 1-block 
operator++ ( ) ; 

embeddedSigma . jumpToOpposite ( ) ; 

} 

} 

... // more functions 





32 Ullrich Kothe 



5.4 Application: Generic Implementation of Connected Components 
Segmentation 

We can now translate the pseudo-code for connected components segmentation given 
in table 1 into actual C++ code. The correspondences between the two versions of the 
algorithm should be easy to see. In order to keep the algorithm independent of the 
data structures, the C++ function is declared as a template of a ceiicompiex data 
structure and a store for additional ceiiData (which holds the cells’ labels here). 

The abstract requirements on the data structure are translated into generic function 
calls as follows: The functions fi = c.beginFaces 0 and cc.endFaces () specify the 
range of an iterator that lists all 2-cells in the cell complex cc. *fi returns a unique 
handle that identifies the current 2-cell. Analogous functionality is provided for 1- and 
0-cells, pi = f i .phicircuiator ( ) returns a circulator for the (p-orbit associated with 
the given 2-cell. pi. face ogives again a handle to that 2-cell. pi. opposite () re- 
trieves the other dart in pi’s a-orbit and creates a (p-circulator associated with that 
dart. Consequently, pi. opposite () .faceO is a 2-cell adjacent to the one given by 
pi.faceO. ei . f irstBoundedFace ( ) and ei . secondBoundedFace ( ) give the tWO 
2-cells bounded by the current 1-cell (where ei is an 1-cell iterator and the assign- 
ment of “first” and “second” is arbitrary). Finally, si = ni.sigmacircuiator () 
creates a circulator for the o-orbit of the current 0-cell referenced by node iterator ni, 
and si . edge ( ) returns the handle to the edge the current dart of O-circulator si is part 
of It shall again be emphasized that these function calls work uniformly across all 
kinds of topological data structures defined in this paper. 

template <class CellComplex, class CellData> 
void 

connectedComponentsSegmentation (CellComplex const & cc,CellData & cd) 

{ 

// initialise the cells' labels to NO_LABEL 
// 2-cells 

CellComplex Facelterator fi = cc . beginFaces { ) ; 
f or ( ; fi != cc . endFaces ( ) ; ++fi) cd. label (*fi) = NO_LABEL; 

// 1-cells 

CellComplex :: Edgelterator ei = cc . beginEdges { ) ; 
f or { ; ei != cc . endEdges { ) ; ++ei) cd. label (*ei) = NO_LABEL; 

// 0-cells 

CellComplex :: Nodelterator ni = cc . beginNodes { ) ; 
f or { ; ni != cc . endNodes { ) ; ++ni) cd . label { *ni ) = NO_LABEL; 

// counter for the connected components 
int current_label = NO_LABEL + 1; 

// find all connected components of 2-cells 
for{fi = cc . beginFaces {) ; fi != cc . endFaces {) ; ++fi) 

{ 

if (cd. label (*fi) != NO_LABEL) 

// 2-cells has already been processed in a prior call to 
// recurseConnectedComponent { ) => ignore 
continue ; 

cd. label (*fi) = current_label++ ; // start a new component 
// get the (p-circulator for the given 2-cell 
CellComplex :: Phicircuiator pi = f i . phiCirculator { ) ; 

// recurse to find 2-cells belonging into current component 
recurseConnectedComponent (pi , cd) ; 
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// label the 1-cells according to the labels of the 2-cells they 
bound 

for{ei = cc . beginEdges { ) ; ei != cc . endEdges { ) ; ++ei) 

{ 

int labell = cd. label (ei . f irstBoundedFace {)) ; 
int label2 = cd. label (ei . secondBoundedFace {)) ; 
if (labell == label2) 

// if the two 2-cells belong to the same component, the 1-cell 
// also belongs to that component 
cd . label ( *ei ) = labell; 
else 

// otherwise, the 1-cell is a border cell 
cd. label (*ei) = BORDER_LABEL ; 

} 

// label the 0-cells according to the labels of the 1-cells they 
bound 

for{ni = cc . beginNodes { ) ; ni != cc . endNodes { ) ; ++ni) 

{ 

// get circulator for a-orbit of current 0-cell 
CellComplex : : SigmaCirculator si = ni . sigmaCirculator { ) ; 

int label = cd . label { si . edge {)) ; // remember label of first 

1-cell 
do { 

if (cd . label ( si . edge () ) == BORDER_LABEL) 

{ 

// if any of the 1-cells bound by the current 0-cell is 
// a border cell the 0-cell is also a border cell 
label = BORDER_LABEL ; break; 

} 

} while (++si != ni.sigmaCirculatorO); 

cd . label ( *ni ) = label; // assign label to 0-cell 

} 

} 



// recursively mark all 2-cells of a component with the same label 
template <class PhiCirculator , class CellComplex, class CellData> 
void recurseConnectedComponent (PhiCirculator pi, CellData & cd) 

{ 

int current_label = cd . label (pi . face ()) ; // remember current label 
do { 

// get (p-circulator for adjacent 2-cell 
PhiCirculator opposite_pi = pi.oppositeO; 

if (cd. label (opposite_pi . face ()) == NO_LABEL Se.Se. 
are_similar (pi . face ( ) , opposite_pi . face ( ) ) ) 

{ 

// if the adjacent 2-cell is similar and not yet assigned, 

// label it and recurse (are_similar ( ) is not discussed here) 
cd . label (opposite_pi . face 0 ) = current_label ; 
recurseConnectedComponent (opposite_pi , cd) 

} 

} 

while (++pi != f i . PhiCirculator 0) ; 

} 

The above algorithm is very efficient: since all iterator/circulator functions execute in 
constant time (except in a block complex), the overall algorithm has complexity 
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Fig. 9. left: merging of two 2-cells by removing a common 1-cell; right: merging of two 
1 -cells by removing the common 0-cell 



0{Na+ E + F(p), where N, E, and F denote the number of 0-, 1-, and 2-cells, and <r 
and (p are the average lengths of the O- and (p-orbits respectively. This is much more 
efficient than the approach of Dufourd and Puitg [3], where the functions to access the 
next dart in an orbit take linear time in the size of the cell complex (i.e. 0{N + E + 
F)), which makes the overall algorithm roughly 0{{N + E + Ff). 

5.5 Elementary Transformations between Cell Complexes 

The question of a uniform interface for cell complex transformations (insertion and 
removal of cells) is much more difficult to answer than that for the operations that 
merely navigate on an existing cell complex. This has several reasons: 

1. Different applications have quite different requirements. For example, in nu- 
meric analysis (FEM) and computer graphics, mesh refinement is a key opera- 
tion. It is often done by replacing a cell with a predefined refinement template, 
so that regularity is preserved. In contrast, in image analysis we are more inter- 
ested in merging neighboring cells in order to build irregular image pyramids. 

2. Different cell complex implementations support different sets of operations. 
For example, a half-edge data structure can essentially support any conceivable 
transformation. In contrast, a Khalimsky topology cannot be transformed at all. 
Block complexes are between the extremes: we can always merge blocks, but 
we can split only blocks that consist of more than one cell. 

Therefore, I will only briefly show which elementary transform have emerged as 
particularly useful in the field of image analysis. The term “elementary” here refers to 
transformations that only involve few cells, so that only a very localized area of the 
cell complex is changed and the operation can be implemented very efficiently. More 
complex transformations can be build on top of the elementary ones. 

The elementary transformations are often called Euler operators [11] because, on a 
planar cell complex, they must respect Euler’s equation v-e+ f = 2 (with v, e, and / 
denoting the number of nodes, edges, and faces respectively). In principle, two Euler 
operators and their inverses are sufficient to span the space of planar cell complexes 
(if we don’t allow “holes” in faces - otherwise we will need three): 

Merge Two 2-Cells: If two 2-cells are bound by a common 1-cell, we may remove 
that 1-cell and merge the 2-cells. 

Merge Two 1-Cells: If a 0-cell bounds exactly two distinct 1 -cells, we may re- 
move the 0-cell and merge the 1 -cells. 
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Fig. 10. merging of two 2-blocks in a block complex: when the 1 -block marked gray is 
removed, the cells belonging to the surviving 2-block have to relabeled (in dark gray) 



Figure 9 shows examples for either operation. By applying them in a suitable order, 
one can reduce any cell complex to a single node located in the infinite (exterior) 
face. The reverse operations are used to refine a cell complex. Complex operations are 
composed from the elementary ones. Sometimes this leads to very inefficient imple- 
mentations of complex operations. Then one may add further specialized operations, 
but for many applications this is not necessary. 

It is important to note that a direct implementation of the elementary operations is 
only possible if we can actually remove cells from the cell complex data structure. In 
case of a block complex, this is not the case. But we can still implement the Euler 
operators: instead of removing cells, we assign new block dimensions and labels to 
the cells according to the modified block structure, as figure 10 illustrates. 

It should be noted that the raw Euler operators do not update the non-topological 
data associated to the cells involved (e.g. geometrical or statistical information). If 
such data are present, it is necessary to augment the Euler operators with additional 
code that handles the update. For example, when two 2-cells are merged, the average 
gray level of the combined 2-cell must be calculated from the statistics of the original 
cells. This can be done by wrapping the raw Euler operators into new operators (with 
the same interface) that carry out additional operations before and/or after calling the 
original Euler operator. This technique is also known as the decorator pattern [4]. 
Due to the flexibility of generic programming, templated algorithms need not be 
modified in order to support this: since the augmented operators still conform to the 
uniform interface definition, they can transparently be called instead of the raw 
operators, without the algorithms knowing the difference. 



6 Conclusions 

This article has shown that generic concepts can make the use of planar cell com- 
plexes in everyday programming much more convenient: interface concepts such as 
iterators, circulators, and data accessors shield algorithms from the particulars of the 
underlying data structure. The separation between algorithms and data structures is 
very important, because most of the know-how of image analysis and other fields 
using cell complexes lies in algorithms rather than data structures. This means that it 
is usually quite difficult and error prone to change existing algorithm implementations 
whenever the underlying data structures change. It is much easier to write some 
mediating iterators and accessors and leave the algorithms untouched. The resulting 
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solutions combine the speed of C++ with the simplicity of the funetional approach of 
Dufourd and Puitg [3]. 

Connected components segmentation is a good case in point. But the advantages of 
generic programming become even more apparent when we apply a segmentation 
algorithm repeatedly and in turn with Euler operations in order to generate an irregu- 
lar segmentation pyramid: Initially, we interpret the entire image as a Khalimsky 
plane, where the pixels are considered 2-cells and the appropriate 0- and 1 -cells are 
inserted between them. Cells are labeled by means of a segmentation algorithm to 
create the first level of the segmentation pyramid (usually, this is an oversegmenta- 
tion). The components (blocks) at this level are then merged into single cells by a 
suitable sequence of Euler operators. This process is repeated to get ever coarser 
pyramid levels, until the final segmentation is satisfying. In order to optimize algo- 
rithm performance, it is a good idea to use a block complex on top of the Khalimsky 
plane for the first few levels, and switch to a half-edge data structure when regions 
become larger. By using the interfaces described here, we can implement segmenta- 
tion and merging algorithms independently of the data structures used. Choosing the 
appropriate kind of cell complex at every pyramid level thus becomes very easy. 

Of course, there is still much to be done. In particular, more cell complex algo- 
rithms should be implemented according to the generic paradigm to realize the ad- 
vantages claimed here and gain new insights for further improvement. The interaction 
between topology and geometry should be investigated more deeply. A wider choice 
of specialized Euler operators and extension to 3D would also be desirable. Never- 
theless, the approach is already working and shows promising results, with good 
performance (a few seconds) in segmenting cell complexes with thousands of cells. 
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Abstract. The paper presents an introduction to computer topology 
with applications to image processing and computer graphics. Basic 
topological notions such as connectivity, frontier, manifolds, surfaces, 
combinatorial homeomorphism etc. are recalled and adapted for locally 
finite topological spaces. The paper describes data structures for 
explicitly representing classical topological spaces in computers and 
presents some algorithms for computing topological features of sets. 
Among them are: boundary tracing («=2,3), filling of interiors 
(«=2,3,4), labeling of components, computing of skeletons and others. 



1 Introduction: Topology and Computers 

Topology plays an important role in computer graphics and image analysis. 
Connectedness, boundaries and inclusion of regions are topological features which are 
important for both rendering images and analyzing their contents. Computing these 
features is one of the tasks of the computer topology. We use this term rather than 
"computational topology" since our approach is analogous to that of digital geometry 
rather than to that of computational geometry: we are using models of topological 
spaces explicitly representing each element of a finite topological space as an element 
of the computer memory, defined by its integer coordinates. The other possible 
approach would be to think about the Euclidean space, to define objects by equations 
and inequalities in real coordinates and to approximate real coordinates on a computer 
by floating point variables. 

Computer topology may be of interest both for computer scientists who attempt to 
apply topological knowledge for analyzing digitized images, and for mathematicians 
who may use computers to solve complicated topological problems. Thus, for 
example, essential progress in investigating three-dimensional manifolds has been 
reached by means of computers (see e.g. [14]). 

Topological ideas are becoming increasingly important in modem theoretical 
physics where attempts to develop a unique theory of gravitation and quantum 
mechanics have led to the Topological Quantum Field Theory (see e.g. [2,13]), in 
which topology of multi-dimensional spaces plays a cmcial role. This is one more 
possible application field for computer topology. Thus, computer topology is 
important both for applications in computer imagery and in basic research in 
mathematics and physics. 

G. Bertrand et al. (Eds.): Digital and Image Geometry, LNCS 2243, pp. 38-58, 2001. 
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We describe here among others the new data structure called the 3D cell list which 
allows to economically encode a segmented 3D image or a 3 -manifold and gives the 
possibility to access topological information without searching. Then we describe 
algorithms on boundary tracing in 2D and 3D, filling of interiors of boundaries in kD 
(successfully tested up to «=4), component labeling, computing skeletons and 
computing basic topological relations in kD, k< 4. All algorithms are based on the 
topology of abstract cell complexes and are simpler and more economical with 
respect to time and memory space than traditional algorithms based on grid point 
models. 



2 Basic Notions 

In this section we recall some notions and definitions from the classical topology, 
which are necessary for reading the subsequent sections. 

2.1 Topology of Abstract Complexes 

There exist topological spaces in which any space element possesses the smallest 
neighborhood [1]. If the smallest neighborhood is finite then the space is called 
locally finite. Among locally finite spaces abstract cell complexes are those especially 
well suited for computer applications, as explained below. Abstract complexes are 
known since 1908 [16]. They are called "abstract" because their elements, the abstract 
cells, need not to be considered as subsets of the Euclidean space. A historical review 
may be found in [7, 17]. 

Definition AC: An abstract cell complex (AC complex) C={E, B, dim) is a set E of 
abstract elements (cells) provided with an antisymmetric, irreflexive, and transitive 
binary relation B (zExE called the bounding relation, and with a dimension function 
dim: E —^1 from E into the set 1 of non-negative integers such that dim(e') < dim(e") 
for all pairs (e',e")eB. 

The maximum dimension of the cells of an AC complex is called its dimension. 
We shall mainly consider complexes of dimensions 2 and 3. Their cells with 
dimension 0 (0-cells) are called points, cells of dimension 1 (1 -cells) are called cracks 
(edges), cells of dimension 2 (2-cells) are called pixels (faces) and that of dimension 3 
are the voxels. 

If (e', e ")e B then it is usual to write e'<e" or to say that the cell e' bounds the cell 
e". Two cells e' and e" of an AC complex C are called incident with each other in C 
iff either e'=e", or e' bounds e", or e" bounds e'. In AC complexes no cell is a subset 
of another cell, as it is the case in simplicial and Euclidean complexes. Exactly this 
property of AC complexes make it possible to define a topology on the set of abstract 
cells independently from any Hausdorff space. 

The topology of AC complexes with applications to computer imagery has been 
described in [9]. We recall now a few most important definitions. In what follows we 
say "complex" for "AC complex". 
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Definition SC: A subcomplex S = (E\ B', dim') of a given complex C = (E, B, dim) is 
a complex whose set E' is a subset of E and the relation 5' is an intersection of B with 
E'xE'. The dimension dim' k equal to dim for all cells of E'. 

Since a subcomplex is uniquely defined by the subset E it is possible to apply 
Boolean operations as union, intersection and complement to complexes. We will 
often say "subset" while meaning "subcomplex". 

The connectivity in complexes is the transitive hull of the incidence relation. It can 
be shown that the connectivity thus defined corresponds to classical connectivity. 
Definition OP: A subset OS of cells of a subcomplex 5 of a complex C is called open 
in S if it contains all cells of S bounded by cells of OS. An n-cell c" of an n- 
dimensional complex C" is an open subset of C" since c" bounds no cells of C". 
Definition SON: The smallest subset of a set S which contains a given cell c of 5 and 
is open in S is called the smallest (open) neighborhood of c relative to S and is 
denoted by SON(c, S). 

The word "open" in "smallest open neighborhood" may be dropped since the 
smallest neighborhood is always open, however, we prefer to retain the notation 
"SON" since it has been used in many publications by the author. 

Definition CL: The smallest subset of a set S which contains a given cell c of 5 and is 
closed in S is called the closure of c relative to S and is denoted by Cl(c, S). 

Definition FR: The frontier Fr(5, C) of a subcomplex 5 of a complex C relative to C 
is the subcomplex of C containing all cells c of C whose SON(c, C) contains both 
cells of S as well as cells of the complement C-S. 

Illustrations to AC complexes, SONs and closures of cells of different dimensions 
maybe found in [9, 10, 12]. 

Definition OF : The open frontier Of(5, C) of a subcomplex 5 of a complex C relative 
to C is the subcomplex of C containing all cells c of C whose closure Cl(c, C) 
contains both cells of S as well as cells of the complement C-S. 

Definition BD: The boundary dS of an n-dimensional subcomplex 5 of a complex C 
is the union of the closures of all («-l)-cells of C each of which bounds exactly one 
«-cell of S. 

Definition TL: A connected one-dimensional complex whose each cell, except two 
of them, is incident with exactly two other cells, is called a topological line. 

It is easily seen that it is possible to assign integer numbers to the cells of a 
topological line in such a way that a cell incident with the cell having the number k 
has the number k-\ or A:-l-l. These numbers are called the topological coordinates of 
the cells [10]. 

Definition CR: A Cartesian (direct) product C" of n topological lines is called an n- 
dimensional Cartesian complex [8]. 

The set of cells of C" is the Cartesian product of n sets of cells of the topological 
lines which are the coordinate axes of the «-dimensional space C". They will be 
denoted by /=1,2,...,«. A cell of C" is an «-tuple {a\, aj,..., a„) of cells a, of the 
corresponding axes: a^eAi. The bounding relation of C" is defined as follows: the n- 
tuple (fli, a2,..., a„) is bounding another distinct n-tuple (bi, b2,..., b„) iff for all 
/= 1 , 2 ,...« the cell a, is incident with b, in Ai and dim{a() < dim(b) in At. 

The dimension of a product cell is defined as the sum of dimensions of the factor 
cells in their one-dimensional spaces. Topological coordinates of a product cell are 
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defined by the vector whose components are the coordinates of the factor cells in their 
axes. 

Fig. la shows four cells in a two-dimensional Cartesian complex: is a 0-cell 

(point), Cl and C 2 are 1 -cells (a horizontal and a vertical crack), C is a 2-cell (pixel). 




a 




b 



Fig. 1. Example of a two-dimensional Cartesian (a) and non-Cartesian (b) complexes 



If we assign even numbers to the 0-cells and odd ones to the 1 -cells of the axes 
then the dimension of a cell in a Cartesian complex is equal to the number of its odd 
coordinates. 



2.2 Combinatorial Homeomorphism 

The notion of the homeomorphism of two sets is a fundamental notion of topology: 
two sets are called homeomorphic or topologically equivalent if one of them can be 
mapped onto the other by a one-to-one continuous function while the inverse map is 
also continuous. There is another classical way to define the homeomorphism, which 
way is directly applicable to complexes and may be extended to other locally finite 
spaces. It is called the combinatorial homeomorphism and is based on the notion of 
elementary subdivisions . 

The concept of an AC complex is too general: it is e.g. possible to define an AC 
complex where a 1-cell is bounded by more than two 0-cells. To avoid such situations 
elementary subdivisions have been defined in classical topology (see e.g. a modem 
survey in [17]) on the base of the topology of the Euclidean space. We give in what 
follows an independent, purely combinatorial definition. It is a recursive definition: 
we make the necessary definitions primarily for 1 -cells and then for cells of still 
greater dimensions. 

A 1-cell which is bounded by exactly two 0-cells is called proper. 

1. An elementary subdivision of a proper 1-cell c*, which is bounded by the 0-cells 
Cl** and C 2 , replaces the complex C'=(c\< c'>C2°) by the 1 -complex C" with two 
1 -cells C\, C2* and a new 0-cell C"=(ci°< Ci'>C3°< C2*>C2°). One or both of 
the 0-cells Ci° and C 2 can be missing. 
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2. The following Definitions should be used recursively: first for m=\, then for m=2 
etc. 

A ff 2 -complex arising through N (N>0) elementary subdivisions of a single proper 
ff?-cell is called an open combinatorial m-ball. When m=\ then it is a sequence of 
pairwise incident 1- and 0-cells, starting and ending with a 1-cell. A single 1-cell is 
also an open combinatorial 1 -ball. 

The boundary of an open m-ball is called a combinatorial (m-\)-sphere. When 
w=l then it consists of exactly two 0-cells. The closure of an m-ball is called the 
closed m-ball. The union of two closed m-balls with identical boundaries is called a 
combinatorial m-sphere. 

An m-cell c", m>l is called proper if its boundary dc"' is an combinatorial (m-1)- 
sphere. 




An elementary subdivision in an «-complex replaces a proper m-cell c”, \<m<n, with 
two proper m-cells Ci™, C'T and one new proper (m-l)-cell bounding both m- 
cells Cl"" and C 2 ™ while the boundary is an (m-2)-sphere 

a(cruc'“-'>UC 2 "’)=ac“ and dc”. 

An AC complex is called proper if all its cells are proper. 

Two proper AC complexes are called combinatorially homeomorphic if they 
possess isomorphic subdivisions. Fig. 3 shows an example. 




Fig. 3. A subdivision of a digitized square, which is isomorphic to a digitized triangle; black 
circles are new points introduced during the subdivision 



2.3 Manifolds and Surfaces 

Among the variety of Hausdorff spaces there are spaces possessing certain relatively 
simple and important topological properties. They are called manifolds. It is known 
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that manifolds of dimension not greater than three may be triangulated. This means 
that there exists a simplicial complex homeomorphic to a given manifold. 

An AC complex which is combinatorially homeomorphic to the triangulation of a 
manifold may represent the topological properties of a manifold (of dimension up to 
three) in the same way as the triangulation does. This fact opens the possibility to 
model manifolds in computers for investigating them. 

Definition MA: An «-dimensional combinatorial manifold («-manifold) without 
boundary is an «-dimensional complex Af in which the boundary of the SON(/’, M) of 
each 0-cell P is homeomorphic to an (w-l)-sphere. In a manifold with boundary the 
SON(/’, M) of some 0-cell P may have a boundary homeomorphic to a "half-sphere", 
i.e. to an («-l)-ball. 

Surfaces in a 3D space are frontiers of 3D subsets of the space. Under rather 
general conditions surfaces are 2-manifolds. Conditions under which the frontier of a 
3D subset is a 2-manifold as well as surfaces which are no manifolds but rather 
"quasi-manifolds" are considered in [11]. 



3 Data Structures 

In this Section we consider some well-known and also some new data structures 
useful for representing topological information in two- and three-dimensional 
digitized images. 

3.1 The Standard Raster 

Two- and three-dimensional images are usually stored on a computer in arrays of the 
corresponding dimension. Each element of the array contains either a gray value, or a 
color, or a density. This data structure is not designed for topological calculations, 
nevertheless, it is possible to perform topological calculations without changing the 
data structure. For example, it is possible to trace and encode the boundary of a region 
in a two-dimensional image in spite of the apparent difficulty that the boundary 
consists of 0- and 1 -cells, however, the raster contains only pixels which must be 
interpreted as 2-cells. The reason is that a pixel is mostly a carrier of an optical feature 
which is proportional to certain elementary area. Thus pixels must correspond to 
elementary areas which are the 2-cells rather than 0- or 1 -cells whose area is zero. On 
the same reason voxels must correspond to 3 -cells. 

The tracing in the standard raster is possible because the concept of an AC 
complex is the way of thinking about topological properties of digitized images rather 
than a way of encoding them. Let us explain this idea for the case of tracing 
boundaries. 
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We think of a two-dimensional (2D) image as of a 2D Cartesian complex 
containing cells of dimensions form 0 to 2. The 2-cells (pixels) have integer 
coordinates which, unlike to topological coordinates of a pixel being always odd 
(compare Section 1.1 and Fig. 1), may take in the standard raster any integer values, 
both odd or even. Pixels are explicitly represented in the raster, whereas the 0- and 1- 
cells are present implicitly. 
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Fig. 4. Non-topological coordinates of cells of lower dimensions 

Coordinate Assignment Rule: Each pixel F of a 2D image gets one 0-cell assigned 
to it as its "own" cell. This is the 0-cell lying in the comer of F which is the nearest to 
the origin of the coordinates {P\ in Fig. 4). Also two 1 -cells incident with F and with 
P are declared to be own cells of F (Ci and C 2 in Fig. 4). Thus each pixel gets three 
own cells of lower dimensions. All own cells of F get the same coordinates as F. 

In the three-dimensional case each voxel gets seven own cells of lower dimensions 
which are arranged similarly. These seven cells get the same coordinates as the 
corresponding voxel. 

Unfortunately, some cells in the boundary of the raster remain without an "owner". 
In most applications this is of no importance. Otherwise the raster must be 
correspondingly enlarged. 

According to the above rule, it is not difficult to calculate the coordinates of all 
pixels incident with a given point and to get the gray values of the pixels from the 
array. Depending on these gray values the tracing point P moves to its next position. 
Details of the tracing algorithm are described in Section 3.1. 

The majority of low level topological problems in image processing may be solved 
in a similar way, i.e. without representing cells of lower dimension as elements of 
some multidimensional array. A typical exception is the problem of filling the interior 
of a region defined by its boundary. The solution is simpler when using two array 
elements per pixel: one for the pixel itself and one more for its own vertical crack (1- 
cell). The solution consists in reading the description of the boundary (e.g. its crack 
code), labeling all vertical cracks of the boundary in the array, counting the labeled 
cracks in each row (starting with 0), and filling the pixels between the crack with an 
even count 2 / and the next crack (with the count 2 /+1). The details of this algorithm 
are described in Section 3.3. 
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Even more complicated topological problems may be solved by means of the 
standard raster. For example, when tracing surfaces (Section 3.2) or producing 
skeletons (Section 3.5) simple pixels must be recognized. A pixel is simple relative to 
a given region if the intersection of its boundary with the boundary of the region is 
connected. It is easier to correctly recognize all simple pixels if cells of all dimensions 
of the region are labeled. To perform this in a standard raster, it is possible to assign a 
bit of a raster element representing the pixel F to each own cell of F. For example, 
suppose that one byte of a two-dimensional array is assigned to each pixel of the 
image shown in Fig. 4. Consider the pixel F with coordinates (1, 2) and the byte 
assigned to it. The bit 0 of the byte may be assigned to the 0-cell P\, the bit 1 to the 1- 
cell Cl, the bit 2 to the 1-cell C 2 . The remaining bits may be assigned to F itself 
Similar assignments are also possible in the 3D case. 

As we see, there is no necessity to allocate memory space for each cell of a 
complex, which would demand four times more memory space than that needed for 
pixels only, or eight times more than that needed for the voxels in the 3D case. 

The most data structures commonly used in the processing of 2D images may be 
used together with the standard raster. These are primarily the run length code and the 
crack code. The latter differs from the widely used Freeman code in that it contains 
only four directions rather than eight of the Freeman code. This is due to the 
properties of a 2D Cartesian complex whose oriented 1 -cells have exactly four 
different directions. 



3.2 The Topological Raster 

Complexes used in topological investigations by means of a computer often have a 
relatively small number of cells. The direct access to cells of all dimensions and the 
possibility to use more than two different labels for cells of lower dimensions is then 
more important than the possibility to save memory space. This is the case, e.g. when 
investigating 3-maniflds represented as boundaries of subsets in a four-dimensional 
space while the space is represented as a four-dimensional array. In such cases a 
topological raster is more suitable than a standard one. 

In a topological raster each coordinate axis is a topological line (see Definition TL 
in Section 1.1). The 0-cells of the axis have even coordinates, the 1 -cells have odd 
coordinates. The dimension and the orientation (if defined) of any cell may be 
calculated from its topological coordinates which in this case coincide with the 
indices of the corresponding array element. The dimension of any cell is the number 
of its odd coordinates, the orientation is specified by indicating which of the 
coordinates are odd. For example, the cell Ci in Fig. 1 above has one odd coordinate 
and this is its X-coordinate. Thus it is a one-dimensional cell oriented along the X- 
axis. The 2-cell F has two and the 0-cell P has no odd coordinates. In a three- 
dimensional complex the orientation of the 2-cells may be specified in a similar way: 
if the /th coordinate of a 2-cell F is the only even one then the normal to F is parallel 
to the /th coordinate axis. 

We shall show in Section 3 how topological relations between cells, like the 
bounding or incidence relations, may be calculated from their topological coordinates. 
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3.3 Data Structures Using Lists of Space Elements 

A data structure designed to efficiently represent topological information must satisfy 
the following two demands: 

1. The structure must contain complete topological information sufficient to get 
knowledge about topological relations among the parts of the image or of a 3D 
scene without a search. To the topological relations belong primarily the 
incidence and the adjacency relations (two distinct subsets are adjacent if there is 
a space element incident with both of them). 

2. The structure must be able to correctly represent non-proper complexes which are 
often used in topological investigations because they contain much less elements 
than the corresponding proper complexes. 

The notions of proper and non-proper complexes have been introduced by the author 
in [12]. We give here only an example and the necessary short explanation. The 
surface of a torus may be represented as a complex consisting of one 0-cell, two 
1 -cells and one 2-cell (Fig. 5a). This representation has the advantage of being very 
simple. 

However, if one would try to interpret this representation as an AC complex, 
difficulties would occur since e.g. the AC complexes corresponding to Fig. 5a and 
Fig. 5b are the same: the same sets of four cells, the same bounding relation and the 
same dimensions of the cells. The difference between these two complexes is that 
each of the 1 -cells L\ and L 2 in Fig. 5a bounds the 2-cell two times, on both sides. This 
may be seen, if one considers the embedding of the complex in a Euclidean space: a 
neighborhood of a point on the 1-cell contains two half-disks each of which lies in 
one and the same 2-cell. However, there is no possibility to describe this relation in 
the language of complexes. 




a 




Fig. 5. Representations of the surface of a toms (a) and of a simple complex (b) 



Since one of our aims is to consider a purely combinatorial approach with no relation 
to a Euclidean space we consider the possibility to overcome this difficulty by 
introducing the notion of an incidence structure [12] as explained below. 

Thus when considering Fig. 5a as a representation of a complex then it is not a 
proper one (see Section 1.2): though each A:-cell with k>Q is homeomorphic to an open 
A:-ball the boundaries of the cells are not homeomorphic to (A-l)-spheres. 

Data structures known from the literature do not fulfil the above demands 1 and 2. 
The classical incidence matrix (see e.g. [15]) enables one to encode any proper cell 
complex. It contains complete topological information. However, it is not suitable to 
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encode non-proper complexes, as explained above. Besides that, it is not economical: 
it contains in the case of an «-dimensional complex 

k=\ 

elements where Nt is the number A:-dimensional cells. This number is in practically 
relevant cases too large. Because of this reasons data structures using "linear" rather 
than "quadratic" lists of space elements are preferable. 

Most 2D data structures of this kind can be hardly generalized for the 3D case. So 
the structures using the notion of "half-edges", e.g. the FTG [5], or the n-G-map [3] 
would need in the 3D case the introduction of "half-faces". In this case each edge 
would occur in so many copies as the number of faces bounded by it. The structure 
would be no more a graph as this is the case for the FTG: a complete FTG structure 
would be needed for each 3D region, which is not economical. No suggestion for a 
3D version of the FTG structure is known to the author. 

In computer graphics and geometric modeling 3D list data structures are known 
since many years. One of the most popular is the "boundary representation" [4]. This 
structure enables one to easily trace the boundary of a 2D face of a body. However, to 
find which bodies in a 3D scene are adjacent to each other demands an exhaustive 
search through the descriptions of all vertices of all bodies in the scene. Even simpler 
questions, as e.g. which edges are incident with a given vertex, demand an exhaustive 
search to be answered. This is true for all 3D data structures known to the author. 

As far as we know, the possibility to represent non-proper complexes was not 
discussed in the literature before the author's publication [12]. 

3.4 The Two-Dimensional Cell List 

A 2D data structure satisfying the above mentioned demands, called the cell list, has 
been suggested by the author [9]. The peculiarity of the cell list is that the topological 
information, namely that of the incidence, is explicitly represented in it: it is possible 
to directly get the information about the boundaries of regions and the endpoints of 
lines. Information about adjacencies is available with a restricted local search since 
"adjacent" means "incident with an incident element". 

The data structure of the cell list is based on the topological notion of a block 
complex [15] which we have adapted to AC complexes [9]. 

Definition BC: Consider a partition Mof a complex A into subsets <3, . Subsets with 

k=Q are 0-cells of A; each of the subsets with k>0 is combinatorially homeomorphic to 
an open k-dimensional ball. There are a bounding relation BR and a dimension 
function Dim defined on Min the natural way. The triple 
B(A)=(M, BR, Dim) 

is called a block complex of A, the subsets o- are called k-dimensional blocks or k- 
blocks. 

Examples of two-dimensional block complexes and cell lists may be found in 
[9,10]. 
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3.5 The Three-Dimensional Cell List 



We call the subcomplex composed of all cells incident with a given proper cell c the 
incidence structure of c. In a 2D space the incidence structure of a cell consists either 
of a cyclic sequence or of two pairs of cells. The cyclic sequences are B-isomorphic 
to one-dimensional complexes (B-isomorphism [11] is a one-to-one map retaining the 
bounding relation but not the dimensions of cells). They can be represented in the 
computer as chained lists. In the 3D case the incidence structures of 0- and 3-blocks 
are B-isomorphic to two-dimensional complexes. 

The author has shown [12] that in the case when the space is a 3-manifold the 
incidence structures are B-isomorphic to two-dimensional spheres. A finite 2-sphere 
is isomorphic to the surface of a convex polyhedron and therefore may be represented 
as a list of polygonal faces. This is the theoretical base of the 3D cell list. This data 
structure is appropriate to describe topological features of 3-manifolds and of 3D 
scenes containing many bodies which may have common faces, edges or vertices. 

Let us consider a simple example of a topological 3D cell list with only two bodies 
where five faces of each cube are considered as a single face. 




Fig. 6. A simple 3D block complex (a) and its cross section perpendicular to L\ (b) 



The 3D cell list of the 3D image of Fig. 6 is shown in the following tables. 



List of branch points (0-blocks) 



Label 


Ason 


Lines 


Pi 


2 


—Li, +L 2 


?2 


2 


+Li, —L 2 



The partial list of the 0-blocks indicates for each 0-block the number Asqn of all 
1 -blocks (lines) incident with P, and their labels. The negative sign of a line's label in 
the row of P, indicates that this line goes away from P,. We have skipped here the 
geometric information, i.e. the coordinates. 

In the list of 1 -blocks (lines) Asqn denotes the number of blocks in the SON of the 
line Li, i.e. the number of blocks bounded by Li. The pointer Zt points to the chained 
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list containing the indices of these blocks as shown in the last column. The order of 
the sequence corresponds to a right-handed rotation around Z,,. A negative sign before 
a label of a face indicates that its normal is oriented against the direction of rotation. 



List of lines ( 1 -blocks) 



Label 


Start! 

ng 

point 


End 

Point 


J^SON 


Point 

er 


Chained list 


Li 


Pi 


Pi 


5 


Zi 


-Fi A Fi A-F2A F2A+F3 


Li 


Pi 


Pi 


5 


Zi 


-Fi A Fi A-F2A F2A+F3 



List of faces (2-blocks) 



Label 


+V 0 I 


-Vol 


Nci 


Point 


Chained list 


Pi 


- 


Vi 


4 


Zi 


Px-^—L2^Pi^—Lx-^Px 


Fi 


Fi 


Vi 


4 


Z 4 


Px-^-L2^Pi^-Lx^Px 


Fi 


- 


Vi 


4 


Z 5 


P 2~^+L2~^P 1 



The list of faces contains for each current face T’, the labels of two volumes bounded 
by Fi. The volume denoted by "+Vol" lies in the direction of the normal of F,. Nq\ 
denotes the number of blocks in Cl{Fi) which is shown as the chained list in the last 
column. The order of the sequence corresponds to a right-handed rotation around the 
normal of T’,. A negative sign before a label of a line indicates that the line is oriented 
against the direction of the rotation. 



List of volumes (3-blocks) 



Label 


Nci 


Faces 


Vi 


2 


+Fx, -F 2 


V 2 


2 


+Pi, 



The partial list of volumes contains for each volume F, the number Nqi of the incident 
faces which are listed in the last column. The negative sign before the label of a face 
in the row of F, indicates that the normal to the face is pointing away from F,. 



4 Algorithms 

We describe here some algorithms for computing topological features of subsets in 
2D and 3D digitized images. Since the programming languages of the C-family are 
now more popular than that of the PASCAL-family, we use here a pseudo-code which 
resembles the C-language. 
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4.1 Boundary Tracing in 2D Images 

Boundary tracing becomes extremely simple when thinking of a 2D image as of a 2D 
complex. The main idea of the algorithm consists in the following: at each boundary 
point find the next boundary crack and make a step along the crack to the next 
boundary point. Repeat this procedure until the starting point is reached again. 
Starting points of all boundary components must be found during an exhaustive 
search through the whole image. The following subroutine Trace ( ) is called each 
time when a not yet visited boundary point of a region is found. 

To avoid calling Trace ( ) more than once for one region, vertical cracks must be 
labeled (e.g. in a bit of Image!]) as "already visited". Traced follows the 
boundary of one foreground region while starting and stopping at the given point 
(x,y) . Points and cracks are present only implicitly as explained above in Section 
2.1. Trace ( ) starts always in the direction of the positive Y-axis. After each move 
along a boundary crack C the values of only two pixels R and L of SON(P) of the end 
point P of C must be tested since the values of the other two pixels of SON(P) have 
been already tested during the previous move. For a detailed description of this 
algorithm see [10]. 

The pseudo-code of Trace ( ) 

Image [NX, NY] is a 2D array (standard raster) whose elements contain gray 
values or colors. The variables P, R, L and the elements of the arrays right [4] , 
left [4] and step [4] are structures each representing a 2D vector with integer 
coordinates, e.g. P.X and P.Y. The operation "+" stands for vector addition. Text 
after // is a comment. 

void Trace (int x, int y, char image []) 

{ P.X=x; P.Y=y; direction=l; 
do 

{ R=P+right [direction] ; // the "right" pixel 

L=P+left [direction] ; // the "left" pixel 

if { image [R] ==foreground) 

direction= (direction+1 ) MOD 4; // right turn 
else 

if { image [L] ==background) 

direction= (direction+3 ) MOD 4; // left turn 
P=P+step [direction] ; //a move in the new direction 
} while ( P . X ! =x II P.Y! =y) ; 

} // end Trace 



4.2 Tracing of Surfaces in 3D 

To trace surfaces of bodies in a 3D standard raster the method by Gordon and Udupa 
[6] uses the 2D technique in 2D slices. The code of a single closed surface is 
disconnected, i.e. it consists of isolated codes of the slices; 33% of the pixels are 
visited twice, which is not economical, and a body whose parts have only common 
cracks but no common faces are considered as disconnected. 

A more efficient method producing a single connected sequence of code elements for 
each closed surface is that of [11]. According to the method the program chooses an 
arbitrary pixel of the surface S as the starting pixel and labels its closure. Then it 
traces the open frontier Of(T, S) (see Section 1.1) of the setT of labeled cells, encodes 
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the pixels of Of(i, 5) (1 byte per pixel), and labels the closures of simple pixels 
(Section 2.1). This ensures that L remains homeomorphic to a closed 2-ball. It has 
been proved that if the surface S is homeomorphic to a sphere then the traced 
sequence is a Hamilton path: each pixel is visited exactly once. Otherwise there 
remain a few non-simple pixels which are visited twice. Their code elements are 
attached to the end of the sequence of simple pixels. Thus the code sequence is 
always connected. 




A verbal description of the algorithm follows. A detailed description and the 
related proofs may be found in [1 1]. 

4.2.1 The Algorithm 

Notations: S is the surface to be traced. It must be a 2-quasi-manifold. LaS is the 
subset of labeled cells; it is homeomorphic to a closed 2-ball. The "rest sequence" is 
the set of non-simple pixels at the stage when all simple pixels of S are already 
labeled. The rest sequence is empty if the genus of S is zero. 

1. Take any pixel of S as the starting pixel label its closure and save its 
coordinates as the starting coordinates of the code. This is the seed of L. Denote 
any one crack of the boundary of Ft(Fq,S ) as Cg/d and find the pixel F of S which 
is incident with C„id and adjacent to Fq. Set F^id equal to T’o and the logical 
variable REST to FALSE. REST indicates that the tracing of the rest sequence is 
running. 

2. (Start of the main loop) Find the crack as the first unlabeled crack of Fr(F’,S) 
encountered during the scanning of Fr(F’,5) clockwise while starting with the end 
point of Cold which is in Fr(L,5). If there is no such crack and F is labeled stop the 
algorithm: the encoding of S is finished. 

3. If F’ is simple label its closure. 

4. Put the direction of the movement from Fgu to Cgu and that of the movement 
from Cold to F into the next byte of the code. If the pixel F is non-simple set the 
corresponding bit in the code (to recognize codes of non simple pixels in the 
ultimate sequence). 

5. If REST is TRUE check, whether F is equal to F^top and is equal to Cstop- 
(These variables were defined in item 6 of the previous loop ). If this is the case 
stop the algorithm and analyze the rest sequence to specify the genus of S as 
explained in [1 1]. Delete multiple occurrences of pixels from the rest sequence. 




52 Vladimir Kovalevsky 



6. If F is simple set REST equal to FALSE; else set F^top equal to F, Cstop equal to 
C„ew and REST equal to TRUE. 

7. Set Fold equal to F. Find the pixel F„ow of S incident with C„ow and adjacent to F. 
Set F equal to F„o„ and Cgu equal to C„ow- Go to item 2. 

End of the algorithm. 

4.3 Filling the Interiors of Boundaries in Multi-dimensional Images 

To test whether an «-cell P of an «-space lies in the interior of a given closed 
boundary it is necessary to count the intersections of the boundary with a ray from P 
to any point outside the space; however, it is difficult to distinguish between 
intersection and tangency (Fig. 8 a and b). The solution becomes easy if the boundary 
is given as one or many («-l)-dimensional manifolds in a Cartesian AC complex and 
the "ray" is a sequence of alternating n- and (w-l)-cells all lying in one row of the 
raster (Fig. 8c). 




Fig. 8. Intersection (a) and tangency (b) are difficult to distinguish in "thick" boundaries; this is 
easy at boundaries in complexes (c) 

In a 2D image the boundary must be a closed sequence of cracks and points (Fig. 8c). 
Then intersections are only possible at vertical cracks and the problem of 
distinguishing between intersections and points of tangency does not occur. The 
method has been successfully implemented for dimensions n=2, 3, 4. 

4.3.1 The Pseudo-Code 

Denote by F the current «-cell of the «-dimensional standard raster. Choose a 
coordinate axis A of the Cartesian space (e.g. A=X in the 2D case). Denote by C (F) 
the own («-l)-cell of F, whose normal is parallel to A (e.g. the vertical crack of F in 
the 2D case). Label all («-l)-cells of the given boundary whose normal is parallel to 
A. In the 2D case when A~X these are the vertical cracks of the given boundary. 

for each row R parallel to A do 
{ BOOLEAN fill=FALSE; 

for each n-cell F in the row R do 

{ if C(F) is labeled then f ill=l— f ill ; // inverting fill 
if fill is TRUE then F= foreground; 
else F=background; 



4.4 Component Labeling in an n-Dimensional Space 

We consider here the simplest case of a 2D binary image in a standard raster while the 
algorithm is applicable also to multi-valued and multi-dimensional images in a 
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topological raster. It is expedient to consider a multi-dimensional image as a one- 
dimensional array Image\N]. For example, in the 2D case the pixel with coordinates 
(x, y) may be accessed as Image[y-NX+x] where NX is the number of pixels in a row. 

In a standard raster a function must be given which specifies which raster elements 
are adjacent to each other and thus are connected if they have the same color. In our 
simple 2D example we use the well-known "8-adjacency" of the foreground pixels 
and the "4-adjacency" of the background pixels. In the general case the adjacency of 
the «-cells of an n-dimensional complex must be specified by rules specifying the 
membership of cells of lower dimensions [9] since an adjacency of «-cell depending 
on their "color" is not applicable for multi-valued images. 

In a topological raster the connectivity of two cells is defined by their incidence 
which in turn is defined by their topological coordinates (see below Condition 3.6.3 in 
Section 3.6). 

4.4.1 The Algorithm 

Given is a binary array Image [] of N elements and the functions 
NumberNeighb (color) and Neighb (i , k) : the first one returns the number of 
adjacent pixels depending on the color of a given pixel; the second one returns the 
index of the kth neighbor of the ith pixel. As the result of the labeling each pixel 
gets additionally (in another array Label [] ) the label of the connected component 
which it belongs to. 



Fig. 9. Illustration to the algorithm of component labeling 

4.4.2 The Pseudo-Code 

Allocate the array Label [N] of the same size as Image [N] . Each element of 
"Label" must have at least log2N bits, where N is the number of elements in Image . 
In the first loop each element of Label gets its own index as its value: 

for (i=l; i<N; i++) Label [i]=i; 
for (i=l; i < N; i++) 

{ color=Image [i] ; 

for (j=0; j <NumberNeighb (color) ; j++) 

{ k=Neighb(i, j) ; //the index of the jth neighbor of i 
if ( Image [k] ==color) SetEquivalent (i , k, Label ) ; 

} // end of the first run 

SecondRun (Label , N) ; // end of the algorithm 

The subroutine SetEquivalent ( ) makes the preparation for labeling the pixels 
having the indices i and k as belonging to one and the same component. For this 
purpose one of the pixels gets the index of the "root" of the other pixel. The function 
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Root ( ) returns the last value in the sequenee of indiees where the first index k is 
that of the given pixel, the next one is the value of Label [k] ete. until Label [k] 
becomes equal to k. The subroutine SecondRun ( ) replaees the value of 
Label [k] by the value of a eomponent counter or by the root of k depending on 
whether Label [k] is equal to k or not. 

4.4.3 Pseudo-Codes of the Subroutines 

subroutine SetEquivalent (i , k, Label ) 

{ if (Root (i , Label ) <Root (k, Label ) ) 

Label [Root (k, Label ) ] =Root (i , Label ) ; 
else Label [Root (i , Label ) ] =Root (k, Label ) ; 

} // end of SetEquivalent 

int Root (k. Label) 

{ do 

{ if (Label [k] ==k) return k; 

k=Label [k] ; 

} while (1) ; 

} // end of Root 

subroutine SecondRun (Label , N) 

{ count=l; 

for (i=0; i<N; i++) 

{ value=Label [i] ; 
if (value==i) 

{ Label[i]=count; 
count= count -hi ; 

} 

else Label [i] =Label [value] ; 

} // end of SecondRun 

4.5 Skeleton of a Set in 2D 

Definition SK: The skeleton of a given set 7 in a two-dimensional image 7 is a subset 
5 cz 7 with the properties: 

a) S has the same number of connected components as 7; 

b) The number of connected components of 7-5 is the same as that of 7-7; 

c) Certain singularities of 7 are retained in S. 

Singularities may be defined e.g. as the "end points" in a 2D image or "borders of 
layers" in a 3D image etc. 

A well-known difficulty in calculating skeletons is that it is impossible to remove all 
simple pixels simultaneously without violating the above conditions. However, 
representing an image as a complex C makes it possible to calculate the skeleton by a 
procedure which may be either sequential or parallel. It is based on the notion of the 
open frontier (s. Section 1.1 above). The procedure consist in removing simple non- 
singular cells of 7 alternatively from the frontier Fr(7, Q and from the open frontier 
0f(7, C). A cell c of the frontier Fr(7, Q (respectively of Of(7, Q) is simple if the 
intersection of SON(c)-{c} (respectively Cl(c)-{c}) with both 7 and its complement 
C-7 is connected. We present a simple version for a 2D topological raster. 
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4.5.1 The Algorithm 

Let C[NX, NY\ be a 2D array with topological coordinates. The subset T is given by 
labeling cells of all dimensions of T: C[x,y] >0 iff the cell (x,y)e T. To delete a cell 
means to set its label C[x,y] to zero. A 0- or 2-cell c is considered as singular iff it is 
incident with exactly one labeled cell other than c. 

To calculate the skeleton of T run the following loop: 

do { Scan C and delete all simple and non-singular cells of 7’nFr(7’, Q; 
CountClose = number of cells deleted during this scan; 

Scan C and delete all simple and non-singular cells of Tr\Of(T, C); 
CountOpen = number of cells deleted during this scan; 

} while (CountClose+CountOpen > 0); 

// end of Algorithm 

Fig. 10 shows an example. 





Fig. 10. a) a given 2D subcomplex T; b) its frontier Fr; c) the set T—Fr. the simple cells of the 
frontier deleted; d) the open frontier Of of the set T-Fr, e) the set T—Fr— Of. the simple cells of 
the open frontier deleted; f) the skeleton 

The result may be, if desired, easily transformed either to a sequence of pixels or to 
1 -complex containing only points and cracks. 

4.6 Algorithms for Topological Investigations 

Topological computations are particularly simple in a Cartesian complex with 
topological coordinates. We present in the following sections some basic algorithms. 

4.6.1 Computing the Dimension of a Cell in an n-Dimensional Space 

If A=(Xi, X 2 , ..., X„) is a cell of an «-dimensional Cartesian complex then 

tl 

Dimension(X) = ^ X.MOD 2. 
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4.6.2 Condition of Bounding in an «-Complex: the Cell A Bounds the Cell B 

Ai is the zth coordinate of the cell A\ dim(Ai) is the dimension of Ai in the ith 
coordinate axis (dim(y4,) is either 0 or 1). 

The condition: Vi = dim(.4;) < dim(S,) a MaxDif = 1; 

where dim( Ai ) = Ai MOD 2; and MaxDif = max fA,-- Bf, i=\...n. 

4.6.3 Condition of Incidence: The Cell A Is Incident with the Cell B 
A bounds B OR B boimds A OR A= B\ 

4.6.4 Computing the SON of a A^-Cell A in an n-Dimensional Space 

To explain the idea we first show as an example all cells of the SON of a 1-cell A 
in a 3D space. Let and A^{2, 3, 6). A has two even coordinates. It is possible to 
change one or both of them by +1 to get a cell bounded by .4. 

The number N of cells bounded by ^4: N = '2-c\ + 2^ C 2 = 2-2 -i- 4- 1=8; where 

C \ denotes the number of combinations of i elements from k. 

The cells bounded by ^4 are: (1, 3, 6) 

(3, 3, 6) 

(2, 3, 5) 

(2, 3, 7) 

(1,3,5) 

(1,3,7) 

(3, 3, 5) 

(3, 3, 7) 

Fig. 11. The SON of a 1-cell in a 3D space 

Now we present the pseudo-code of a function computing the SON of a cell "Cell" 
in a 4-dimensional space. The coordinates of all cells Ce SON(Ce//) will be saved in 
the array SON [ ] [4]. 

4.6.5 The Pseudo-Code 

void SaveSON(int Cell[4], int SON[ ][4]) 

{ int C[4], step[4]; 

//"step" contains the increments of the coordinates: 1 for even and 0 for odd ones 
for (k=0; k<4; k++) 

{ if ((Cell[k] MOD 2)=0) step[k]=l; 
else step[k]=0; 

} 

i=0; 

// four nested loops; C runs through all cells bounded by Cell: 
for (C[3]=Cell[3]-step[3]; C[3]<Cell[3]+step[3];C[3]++) 
for (C[2]=Cell[2]-step[2]; C[2]<Cell[2]+step[2];C[2]++) 
for (C[l]=Cell[l]-step[l]; C[l]<Cell[l]+step[l];C[l]++) 
for (C[0]=Cell[0]-step[0]; C[0]<Cell[0]+step[0];C[0]++) 

{ for (k=0; k<4; k++) SON[i][k]=C[k]; 
i=i+l; 

} 




} // end of SaveSON 
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A similar algorithm for computing the closure of a eell in a 4-dimensional space: 

void SaveClosure(int Cell[4], int Cl[ ][4]) 

{ int C[4], step[4]; 

//"step" contains the increments of the coordinates: 0 for even and 1 for odd ones 
for (k=0; k<4; k++) 

{ if ((Cell[k] MOD 2)==1) step[k]=l; 
else step[k]=0; 

} 

i=0; 

// four nested loops; C runs through all cells bounding Cell: 
for (C[3]=Cell[3]-step[3]; C[3]<Cell[3]+step[3];C[3]++) 
for (C[2]=Cell[2]-step[2]; C[2]<Cell[2]+step[2];C[2]++) 
for (C[l]=Cell[l]-step[l]; C[l]<Cell[l]+step[l];C[l]++) 
for (C[0]=Cell[0]-step[0]; C[0]<Cell[0]+step[0];C[0]++) 

{ for (k=0; k<4; k++) Cl[i][k]=C[k]; 
i=i+l; 

} 

} // end of SaveClosure 



5 Conclusion 

We have demonstrated that abstract cell complexes may be successfully used for 
modeling locally finite topological spaces satisfying the elassical axioms on a 
computer and for solving topologieal problems. We have suggested a new data 
structure, the 3D eell list, for encoding 3D segmented images or 3-manifolds in such a 
way that topological relations between subsets may be immediately extracted from the 
structure without searching. Another important property of the cell list is the 
possibility to consistently deseribe the so-called non-proper complexes having the 
advantage of being very simple although not representable by classical means since in 
a non-proper complex a cell may multiply bound another cell. 

We have presented descriptions and/or pseudo-codes of seven basie algorithms for 
computing topological features of subsets of an abstraet cell complex represented on a 
computer as an «-dimensional array, «<4. Some of the described algorithms are the 
neeessary tools when implementing high-level topological algorithms, namely: 

Automatic calculation the cell list of a segmented (labeled) «-dimensional image 
(«=2 or 3); 

Handle decomposition of a 3 -manifold in a 4D space, which is useful for 
comparing two manifolds with each other. 

Identification of faces of a polyhedron represented by a cell list and producing the 
cell list of a 3 -manifold with the aim of comparing two manifolds with each other. 

Modeling of linked spheres in an «-dimensional space («=3, 4, 5) with the aim to 
experimentally investigate their topological properties. 

The letter algorithms have been developed and successfully tested by the author, 
however, they cannot be described here because of paper size limitations. 

An important open problem is, whether the described approach, which yields very 
simple descriptions of 3-manifolds, may contribute to the problem of their 
classification. 
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Abstract. Our goal is to describe an economic way of presenting 
3-manifolds numerically. The idea consists in replacing 3-manifolds by 
cell complexes (their special spines) and encoding the spines by strings 
of integers. The encoding is natural, i.e., it allows one to operate with 
manifolds without decoding. We describe an application of the encoding 
to computer enumeration of 3-manifolds and give the resulting table. A 
brief introduction into the theory of quantum invariants of 3-manifolds 
is also given. The invariants were used by the enumeration for auto- 
matic casting out of duplicates. Separately, we investigate 3-dimensional 
submanifolds of R^. Any such submanifold can be presented by a 3-di- 
mensional binary picture. We give a criterion for a 3-dimensional binary 
picture to determine a 3-manifold. 



1 Cell Complexes 

In topology of manifolds the notion of a cell complex is usually considered in a 
more general sense than in computer topology (see [4], [5]). So we briefly recall it, 
restricting ourselves to the 2-dimensional case. We prefer to do that inductively. 

1 . A 0-dimensional cell complex is a finite collection of points called vertices. 

2. An 1-dimensional cell complex is obtained from a 0-dimensional com- 
plex A° by attaching several 1-dimensional cells (i.e., arcs). The endpoints 
of the arcs are attached to the vertices of A°, and the arcs must have no 
other common points. It is easy to see that a 1-dimensional cell complex is 
nothing more than a graph (loops and multiple edges are allowed). 

3. A 2-dimensional cell complex is obtained from a 1-dimensional com- 
plex by attaching several 2-dimensional cells. In other words, we take 
a collection {£>i,...,D„} of disjoint 2-dimensional discs and attach each 
disc Di to X^ via an attaching map ipp. dDi — > X^. It is convenient to assume 
that the inverse image (p~^{e) of each open 1-cell e of X^ consists of open 
connected subarcs of dDi such that each of them is mapped onto e homeo- 
morphically. In other words, we require that the boundary curve h = ipi{8Di) 
of every 2-cell passes along the edges monotonically, without returns inside 
them. X^ is called the 1-dimensional skeleton of X^. 

We point out that in general the incidence relation between cells does not 
determine X^ uniquely. It is necessary to know in which direction the boundary 
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curve of each 2-cell passes along each edge as well as a cyclic order in which li 
passes along different edges. For example, consider two cell complexes obtained 
from a rectangle by identifying its vertices to one point and by different identi- 
fications of the sides (see Fig. 1). Each of them has one vertex, two loop edges, 
and one 2-cell attached to each edge exactly twice. Nevertheless, the complexes 
are different, since one of them is a torus while the other is a 2-sphere with three 
identified points. 



1 — 




^ 1 




2 2 




V 


’ 




• > — ' 




< i 



1 



2 





Fig. 1. Two complexes with the same incidence relation 



On the other hand, information on the graph X^, on the orientations of edges, 
and on the order in which the boundary curves of the 2-cells of pass along the 
oriented edges is quite sufficient for reconstructing X^. The corresponding data 
can be written numerically as follows. First, we enumerate the vertices and the 
edges, and write down all the edges as a sequence of pairs (*i, ji), . . . , 

Here ik^jk are the numbers assigned to the vertices joined by the edge number k. 
Simultaneously, we orient each edge by an arrow directed from the vertex no. ik 
to the vertex no. jk- Second, for each 2-cell we write down a string (jpi p 2 . . . Pn) 
of non-zero integers (± edges’ numbers) which show how the boundary curve of 
the 2-cell passes along the edges. The sign of pi shows us the direction in which 
the curve passes along the edge. For example, the complexes in Fig. 1 can be 
described by the following data: 

1. Edges: (1, 1), (1, 1) (in both cases); 

2. 2-Cells: (12 — 1 — 2) in the first case and (1 — 1 2 — 2) in the second. 

Obviously, the total number of vertices, the sequence of pairs, and the col- 
lection of the strings determine X^. It is less obvious that if the link of each 
vertex is connected (i.e., if each vertex of X^ has a cone neighborhood with a 
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connected base), then is completely determined by the strings only. Indeed, 
to recover X^, it suffices to do the following: 

1. Realize the strings by boundaries of disjoint polygons with oriented and 
numbered edges; 

2. Identify the edges that have the same numbers via orientation preserving 
homeomorphisms. 

Then the 2-dimensional cell complex thus obtained is homeomorphic to X^. 
Later on we will use this observation for encoding cell complexes by strings only. 



2 Spines of 3-Manifolds 

A compact topological space M is called a 3-manifold, if every point x G M has 
a ball or half-ball neighborhood such that x corresponds to the center. The union 
of all points of M which admit no ball neighborhood is called the boundary of M 
and denoted by dM. The boundary of any 3-manifold M is a compact surface. 
If the boundary is empty, M is called closed. 

Definition. Let P be a subpolyhedron of a 3-manifold M with nonempty 
boundary. Then P is called a spine of M , if the difference M\P is homeomorphic 
to the direct product of dM and a half-open interval. By a spine of a closed 
3-manifold M we mean a spine of the punctured manifold Mq = M\V, where V 
is an open 3-ball in M. 

The direct product structure allows us to squeeze (or collapse) M continu- 
ously onto P. Usually one considers spines that cannot be collapsed onto smaller 
subpolyhedra. That happens when no triangulation of a spine has principal sim- 
plices with free faces (then we cannot start the collapse). For example, a natural 
spine of the 3-ball is a point. Nevertheless, has many other spines, the 
most famous is the Dunce hat D obtained from a triangle by identifying its edges 
as shown in Fig. 2. There is an evident collapsing of B^ onto the lateral surface 
of a cone contained in it, which determines a collapsing of B^ onto the Dunce 
hat. 




Fig. 2. Dunce hat 
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3 Special Spines 

A spine of a 3-manifold carries much information about the manifold, in particu- 
lar, all its homotopy characteristics such as the Euler characteristic and the 
fundamental group. Nevertheless, different 3-manifolds can have homeomorphic 
spines. For example, the punctured solid torus and the punctured manifold x 

are distinct, but can be collapsed onto a wedge of a 2-sphere and a circle. 
To avoid that, we restrict our attention to so-called special spines having a very 
nice local structure. 

Definition. A 2-dimensional polyhedron is called special if it can be pre- 
sented as a cell complex such that every vertex has a neighborhood E homeo- 
morphic to the open cone over the complete graph A on four vertices. 

We will call E a butterfly. It is a strange butterfly indeed. Its body consists 
of four segments having a common endpoint, and it has six wings (see Fig. 3, 
where E is shown in three different forms). A spine of a 3-manifold is called 
special, if it is a special polyhedron. 

The 1-dimensional skeleton of a special polyhedron is a regular graph of 
valence four. The edges consist of triple points, and the vertices can be viewed 
as crossing points of triple lines. One can regard special spines as generic spines 
or spines in general position. 




Fig. 3. Equivalent ways of looking at vertices 



Theorem 1 ([!]) Any 3-manifold possesses a special spine. 

This is not surprising, since in squeezing a manifold to its spine we have a 
lot of freedom, which is sufficient for avoiding non-general position singularities. 
Two examples of special spines of the 3-ball (Bing’s House with Two Rooms and 
the Abalone) are shown in Fig. 4. Notice that the Dunce hat is not special, since 
its unique vertex has no neighborhood of the required type. 

Let us demonstrate how can be collapsed onto Bing’s House. First, we 
squeeze B^ onto the cube shown in Fig. 4. Then we penetrate through the upper 
tube into the lower room and exhaust the interior of the room. At last, we do 
the same with the upper room. 
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Fig. 4. Bing’s House and the Abalone 



To collapse the 3-ball onto the Abalone, we collapse it onto the Abalone 
with a filled tube. Then, starting from the ends of the tube, we push in the 
3-dimensional material of the tube until we get a meridional disc. 

Theorem 2 ([1], [7]) If two 3-manifolds possess homeomorphic special spines, 
then they are homeomorphic (we assume that either both manifolds are closed or 
both have nonempty boundaries). 

In other words, every 3-manifold can be reconstructed from any its special 
spine in a unique way. The reason why the theorem is true consists in a simple 
local structure of special spines. Combining this fact with the above numerical 
description of cell complexes, we get a very economic encoding of 3-manifolds by 
strings of integers. For example, Bing’s House and the Abalone can be presented 
respectively by three and two strings: (1) (2) (1 3 -2 -3 4 2 -4 -1 4 -3) and 
(- 1 ) (12 2 1 - 2 ). 

The dodecahedron space (Poincare homology sphere) can be presented by 
the following six strings describing its minimal special spine: 

(1 3 -9 -2 -6) (3 5 -6 -4 -8) (5 2 -8 -1 -10) 

(2 4 -10 -3 -7) (4 1 -7 -5 -9) (6 7 8 9 10). 

In Fig. 5 (left) we show the 1-dimensional skeleton of the spine. The spine 
can be obtained from the skeleton by attaching six 2-cells along the curves corre- 
sponding to the above six strings. There is a convenient way to present the spine 
graphically by drawing a regular neighborhood of the skeleton in the spine, see 
Fig. 5 (right). The neighborhood is obtained from the 1-dimensional skeleton by 
attaching six annuli. The remaining part of the spine consists of six discs, each 
contained in the interior of the corresponding 2-cell. The spine can be obtained 
from the neighborhood by attaching these discs to all six closed curves in its 
boundary. 

The Poincare sphere is the simplest closed 3-manifold (different from the 
standard sphere S^) whose homology groups coincide with the ones of S^. In 
particular, its first homology group is trivial. The Poincare sphere can be ob- 
tained from the regular dodecahedron by identifying its pentagonal faces such 
that each face is glued with the opposite one along a homeomorphism between 
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them induced by a translation and rotation by 27 t/ 10, see Fig. 6. The images of 
faces of the dodecahedron under gluing form the spine described above. 



Fig. 5. A special spine of the dodecahedron space and its 1-dimensional skeleton 



Fig. 6. A face identification schema for the regular dodecahedron producing the 
Poincare sphere 

Just as the Poincare sphere, any special spine admits a similar graphical 
presentation by a neighborhood of its singular graph. Such presentations for 
Bing’s House with two rooms and the Abalone are shown in Fig. 7. Fig. 8 (left) 
presents another special spine of the 3-ball, a mutant of Bing’s House. It can be 
called a double Abalone, see Fig. 8 (right). 

4 Moves on Special Spines 

Thanks to large freedom in constructing, every 3-manifold possesses many dif- 
ferent special spines. How can one describe them all? The question is important. 




9 
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Fig. 7. Graphical presentations of Bing’s House and the Abalone 




Fig. 8. Double Abalone, a mutant of the Bing House 



since it is related to comparing different numerical codes of the same 3-manifold. 
Let us describe a move T on special spines which is sufficient for the purpose. 

Consider two typical fragments Ai, E 2 of special spines. Ei is a regular neigh- 
borhood of an edge in a special spine. It consists of a “cap” and a “cup” joined 
by a segment, with three attached “wings” (see Fig. 9, to the left). Since Ei 
contains two vertices, it can be considered as a “double butterfly”. E 2 is the 
union of the lateral surface of a cylinder, a middle disc, and three wings (see 
Fig. 9, to the right). Note that the fragments have the same natural boundary 
consisting of two circles joined by three arcs. 





Fig. 9. The T-move 
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Definition. The elementary move T on a special spine P consists in remov- 
ing a fragment E\ <Z P and replacing it by £’ 2 - 

Notice that T increases by one the number of vertices of a spine, while the 
inverse move T~^ decreases it. Also note that we start and finish with special 
spines that have more than one vertex each. 

Theorem 3 ([7], [11]) Let P,Q he two special polyhedra with more than one 
vertices each. Then P, Q are spines of the same 3-manifold if and only if one 
can take P to Q by a finite sequence of moves . 

The proof of the theorem is nontrivial and based on well-known fact that 
any two triangulations of a 3-manifold have a common star subdivision. It is 
worth noticing that, from an algorithmic point of view. Theorem 3 does not 
give us a solution of the homeomorphism problem for 3-manifolds, which is still 
open. Of coarse, given two 3-manifolds Mi, M 2 , we can construct their special 
spines P\,P 2 and attempt to transform Pi to P 2 by a sequence of moves 
Having found at least one such sequence, we can conclude that Mi, M 2 are 
homeomorphic. Nevertheless, this method never tells us that Mi, M 2 are distinct, 
since we are unable to test all possible sequences of moves (there are infinitely 
many of them). 

We remark that moves can be easily realized on the level of numerical 
codes of the spines. For example, to realize the move T, we do the following: 

1. Take the number i presenting a given edge and find in the strings all three 
triples ill i * i *2 consisting of ii and two its neighbors. These triples 
present the wings of Ei. 

2. Find in the strings all six neighboring pairs of the type i*i ii 2 , where ii, *2 ^ 
±i are contained in the above triples. The pairs present the six triangles 
forming the cup and the cap of E\. 

3. Insert a new string of three integers corresponding to the edges of the new 
2-cell which appears under T, and replace the triples for the wings and the 
pairs for the triangles of Ei by the pairs for the wings and the triples for the 
quadrilaterals of £ 2 - 

5 Tabulation of 3-Manifolds 

Recall that the set of all 3-manifolds possesses a natural additive structure: 
given two 3-manifolds Mi, M 2 , one can construct their connected sum Miff M 2 
by removing two open 3-balls Bi C Mi , £2 C M 2 and identifying the two 
boundary spheres arising in this way. For example, any 3-manifold M admits 
trivial decompositions M = MffS^ and M = S^ffM. Manifolds admitting no 
nontrivial decompositions are called prime (with an exception of S^, which is a 
neutral element). It is known that any orientable 3-manifold can be presented as 
a connected sum of prime factors and that the factors are determined in a unique 
way. Therefore the tabulation of 3-manifolds can be reduced to the tabulation 
of prime ones. 

This very much resembles the situation with natural numbers. Indeed, every- 
body knows that any natural number can be decomposed into a product of prime 
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numbers, and the decomposition is unique. Prime manifolds are thus analogous 
to prime numbers and were named after them. 

We define the complexity c{M) of a prime 3-manifold M ^ x to be the 
minimal number of vertices of its special spines. According to this definition, the 
complexity of every 3-manifold is greater than 0 (since any special polyhedron 
must have at least one vertex). On the other hand, there are exactly three prime 
3-manifolds whose natural minimal spines are in some sense simpler than any 
special polyhedron: the projective space x S^, and the lens space 

(the quotient space of 5”^ by a linear action of the cyclic group of order 3). Their 
natural minimal spines are a projective plane, a 2-sphere with an attached arc, 
and a triple hat (the 2-dimensional cell complex presented by the string (11 
1)). The natural minimal spine of 5"^ (a point) is also simpler than any other 
polyhedron. As an exception, we set the complexity of these four manifold to be 
zero. 

It turns out that this complexity is naturally related to all the known methods 
of presenting manifolds and adequately describes complexity of manifolds in 
the informal sense of the expression. In particular, if a 3-manifold M admits a 
triangulation containing n tetrahedra, then c{M) < n. 

Another nice property of the notion is that there are only finitely many 
closed prime manifold of a fixed complexity. To see that, let us show that for 
any integer k the number of special polyhedra with k vertices is finite. First 
we enumerate all regular graphs of degree 4 with k vertices. Clearly, there is 
only a finite number of them. Given a regular graph, we replace each vertex v 
by a butterfly Ey (a copy of the standard vertex singularity) such that the 
four-segment body of Ey coincides with the union of initial segments of edges 
emanating from v, see Fig. 10. For each edge e of G, we glue together the 




Fig. 10. Constructing spines from butterflies 



butterflies corresponding to its endpoints: three wings of the first butterfly are 
glued to the three wings of the second. It can be done in six different ways. 
We get a 2-dimensional polyhedron whose natural boundary consists of circles. 
Attaching 2-cells to the circles, we obtain a special polyhedron. It is clear that all 
special polyhedra with k vertices can be obtained in this way. Not all of them are 
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spines of 3-manifolds. Nevertheless, the resulting finite set of special polyhedra 
contains all special spines with k vertices. 

It follows that for any k there are only finitely many closed prime 3-manifolds 
of complexity k. So it is natural to tabulate 3-manifolds in order of increas- 
ing complexity. The following table had been obtained by means of a com- 
puter [8,6,9]. N{k) is the number of closed orientable prime 3-manifolds of com- 
plexity k. 



k 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


N{k) 


3 


2 


4 


7 


14 


31 


74 


175 


436 


1160 



Let us comment the table. 

1) As we have mentioned above, prime manifolds of complexity 0 are 
RP^, 5'^ X S^, and L34 {S^ is usually considered as a manifold which is not 
prime). 

2) All manifolds of complexity < 5 are Seifert manifolds (see [12]) and have 
finite fundamental groups. They are quotient spaces of by linear actions of 
different finite groups. 

3) The table contains only one nontrivial homology sphere (the Poincare 
sphere, which has complexity 5). 

4) All six flat (i.e., having Euclidean geometric structure) 3-manifolds have 
complexity 6; the torus x x is one of them. Also, the list of manifolds 
of complexity 6 contains representatives of all the four known series of elliptic 
3-manifolds [10]. 

5) All manifolds of complexity < 8 are graph-manifolds of Waldhausen [14]. 
The first two manifolds admitting a complete hyperbolic structure appear on the 
level of complexity 9. One of them has the smallest hyperbolic volume among all 
known closed orientable hyperbolic 3-manifolds [2]. Hyperbolic manifolds with 
the four next values of the volume have complexity 10. 

The computer program enumerating 3-manifolds works in the following way. 
It first looks through all the regular graphs of degree 4 with a given number of 
vertices and, for each graph, lists all the possible gluings together of butterflies 
corresponding to its vertices. Then we attach 2-cells to the boundary circles of 
the resulting 2-dimensional polyhedra. Each special polyhedron P thus obtained 
is tested in the following respects : 

1. Can it be thickened, i.e., is it a spine of a 3-manifold M? 

2. Is M closed and orientable? 

3. Is M prime? 

4. Is P a minimal spine of M? 

5. Is M different from all the manifolds obtained on the previous steps? 
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Having obtained a negative answer to at least one of the questions, the com- 
puter refuses to consider the proposed choice of pasting the butterflies and goes 
to the next version. In the converse case, the result is stored. 

Let us comment how the computer performs the tests. Answers to the first 
two questions can be easily obtained algorithmically. There is also an algorithm 
answering the third question, but it is enormously huge and cannot be imple- 
mented in practice. No complete algorithms for answering the last two questions 
are known, even theoretically. So to answer questions 3-5 we are compelled to 
content ourselves with partial algorithms. 

To answer the questions 3,4, we try to apply to P different combinations of 
simplification moves that preserve M, see [9]. One of the moves is T~^. Other 
moves consist in adding to P one 2-cell, removing another one, and collapsing the 
resulting spine as long as possible. Any simplification move transforms P either 
to a simpler special spine, or to a collection of simpler special spines joined by 
arcs. So spines admitting simplification moves are not interesting: either they 
are non-minimal (in the first case) or they determine reducible manifolds (in the 
second one). 

To answer the last question, we apply two algorithms. The first one tries to 
transform P by successive application of moves to a special spine obtained 
earlier and thus prove that M is not new. The second one compares M with all 
the manifolds obtained on the previous steps by calculating different 3-manifold 
invariants, in particular, by calculating homology groups and Turaev-Viro in- 
variants (see the next section). An important experimental observation is that 
up to the level of complexity 10 the joint work of these algorithms always gives 
a definitive answer: we either prove that M is a new manifold (then we include 
it into the table) or prove that it is a duplicate (then we reject it). 

6 The Turaev-Viro Invariants 

How can one prove that two given 3-manifolds are not homeomorphic? Usually 
this can be done by means of various invariants, i.e., numerical or algebraic 
characteristics of manifolds. We compute the invariants of the manifolds and 
compare them. If they are distinct, then so are the manifolds; the converse is 
not always true. We describe here the so-called state sum, or quantum, invariants 
introduced by V. Turaev and O. Viro. Deep notions of statistical mechanics 
and quantum physics lie behind them ([13]). The construction of the invariants 
proceeds in a number of steps. 

Step 1. Choose an integer iV > 1. 

Step 2. Consider the set integers C = {0, 1, . . . , — 1}. These integers will 

called colors. Let us associate to each color i a real number Wi called its weight. 
We will consider Wi as variables whose values are to be determined. 

Step 3. Consider the set of all butterflies with wings colored by the colors 
from the palette C. Two colored butterflies are said to be equivalent, if there 
is a color preserving homeomorphism between them. The set is finite, since the 
butterfly has 6 wings, and thus we have < colorings. 



70 



Sergei Matveev 



Step 4. To each colored butterfly we assign a real number called the weight 
of the butterfly. There arises the following problem: how to denote colored but- 
terflies and their weights? Note that a colored butterfly is uniquely determined 
by the triple of colors i,j, k of wings adjacent to an edge, and the triple k, I, m 
of colors of the opposite wings (two wings are opposite, if their intersection is 
a vertex rather than an edge, see Fig. 11). For typographical convenience, and 




Fig. 11. Colored butterfly 



following some earlier conventions, such a butterfly will be denoted by (2 x 3)- 



matrix 



and called 



Z 7 A/ \ % 1 k 

, I . The weight of this butterfly is denoted by , 

I mn j I mn 

a quantum (q — 6j)-symbol, for reasons we will not go into here. We will think 

of the symbols as variables whose values are also to be determined. 

Step 5. Let P be a special polyhedron, V (P) the set of its vertices, and C{P) 

the set of its 2-cells. A coloring of P is a map ^ : C'(P) — *■ C. This means that the 

color ^(c) is used for a 2-cell c. Denote by Col{P) the set of all colorings of P. 

It consists of elements, where N is the number of colors and =ffC{P) is 

the number of 2-cells in P. To each coloring ^ gCo1{P) we associate its weight 

w{^) according to the rule 



MO = n 

vev{P) 



i j k 
I m n 



n 

’ cGC(P) 



( 1 ) 



Note that the coloring ^ determines a coloring of a neighborhood of every 
vertex v € V{P). It means that in a neighborhood of v we see a colored butterfly 



J 

m 



k 



with the (g— 6j)-symbol 



i j k 
I m n 



. Every 2-cell c of P is colored by 



V 



the color f(c) having weight ic^(c). Therefore, the right-hand side of the formula 
is the product of all symbols and weights of used colors (with multiplicity) . 
Definition. The weight of a special spine P of P is given by the formula 



w{P) = MO- 

^eCoi{p) 
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Step 6. Of course, the weight of a special polyhedron depends heavily on 
the specific values of color weights and symbols. So we run into the following 
problem: under what conditions is the weight of a spine an invariant of the 
manifold, i. e., does not depend on the choice of a specific spine? To answer, let 
us try to subject the variables to constraints which would insure that the weight 
of a special polyhedron is invariant with respect to T-moves. In order to do that, 
let us write down the following system of equations: 



i j k 
I 771 n 



i j k 
V m' n' 




i m n 
z n! m! 



j I n 
z n' r 



k I m 
zm' V ’ 



(2) 



where f, j, fc, Z, m, n, m! , n', z run over the palette C. 

The geometric meaning of these equations is clear from Fig. 12. Let a special 
polyhedron P 2 be obtained from a special polyhedron Pi by one T-move. Our aim 
is to choose values of weights of the colors and symbols of the colored butterflies 
so that for any Pi,P 2 the equality w{Pi) = w{P 2 ) holds. The left-hand side of 
each equation equals the contribution of the fragment Ei to the weight w(Pi), 
and the right-hand side is equal to the contribution of E 2 to w{P 2 ). We take 
into account only those weights and symbols that occur in both w{Pi),w{P 2 ). 
They correspond to five butterflies (two on the left and three on the right) and 
a unique 2-cell colored by z, which occurs on the right-hand side but not on the 
left one. 




Fig. 12. Graphical interpretation of the equation 



It follows from Theorem 3 that if the weights of colors and symbols of but- 
terflies form a solution of the system, then the weight of a special spine is an 
invariant of the manifold. 

One can easily see that the number of equations is much greater than the 
number of variables. Nevertheless, V. Turaev and O. Viro found solutions for 
arbitrary N. Saying “found” I mean that literally: they found them in a paper 
devoted to a different subject (see [3])! 

Explicit expressions for the solutions are too cumbersome to include them 
here. We restrict ourselves to the simplest case N = 2, when only two colors are 
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used, and describe one of three independent solutions which are possible in this 
case. Let e = (1 + -\/5)/2 or e = (1 — y/b)/2). Then the solution is given by the 



formulas wq 



l,wi = e, 



0 0 0 
0 0 0 



0 0 0 

’111 



= £-1/2, 



oil 

oil 



0 1 1 
1 1 1 



= e 



-1 



and 



111 

111 



= £ 



-2 



From the theoretical point of view, the calculation of invariants causes no 
problems, since the weights and symbols are known. In order to perform it, one 
must exhaust all colorings, calculate their weights by multiplying the weights of 
colors and the symbols of colored butterflies, and sum up all the numbers thus 
obtained. Computer programs based on this approach work well as long as both 
the number of colors and the number of vertices of the spine are about ten or 
less. 



7 Submanifolds of 

In this section we describe the class of compact 3-manifolds which can be em- 
bedded into R^. The class seems to be interesting, since any such manifold M 
can be realized as a union of unit cubes of the standard cubic lattice in R^. One 
can think of the cubes forming the union as being black, and of all other lattice 
cubes as white. Thus we get a presentation of M by a 3-dimensional binary 
picture. 

There is a simple criterion to decide if a given 3-dimensional binary picture 
presents a 3-manifold. We say that two lattice cubes are related by a jump, 
if they share a common 2-dimensional face and have the same color. We will 
distinguish black and white jumps according to the color of the cubes. 

Criterion. A 3-dimensional binary picture represents a 3-dimensional mani- 
fold if and only if any two one-colored cubes can be joined by < 3 jumps of the 
same color. 

The forbidden situations are shown in Fig. 13: 

1. There are two black cubes that have no common edges, but have a common 
vertex incident to six white cubes. 

2. There are two black cubes that have no common 2-dimensional face, but 
have a common edge incident to two white cubes. 

3. There are two white cubes that have no common edges, but have a common 
vertex incident to six black cubes. 

The evident duality between conditions 1 and 3 and the self-duality of condi- 
tion 2 reflects the fact that M is a manifold if and only if so is the background. It 
would be interesting to And similar conditions for n-dimensional binary pictures 
presenting n-dimensional manifolds. 

Let us investigate 3-manifolds in R^ from a purely topological point of view. 
To describe a general way for constructing them, we introduce two basic moves. 
Let M be a compact 3-dimensional submanifold of R^. 
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Fig. 13. Three situations that force a non-manifold binary picture 



Move A. We attach to M a solid tube x [0, 1] that intersects M along 
two discs D'^ X {0, 1} C 9M; 

Move B. We cut off from M a solid tube x [0, 1] C M that intersects 
dM along two discs x {0, 1}. In other words, we drill in M a tubular hole. 

Move C. (Puncturing.) We cut off a 3-ball from the interior of M. 

Note that solid tubes and tubular holes may be knotted and linked, and run 
inside each other, see Fig 14. 




Theorem 4 Any compact 3-dimensional submanifold of can be obtained 
from a 3-ball by a finite sequence of Moves A-C. If the boundary dM of M is 
connected, then M can be obtained from a 3-ball by Moves A, B only. 

The proof is based on the well-known fact that any connected closed surface 
F in is compressible in the following sense: there exists an embedded 

disc D C R^ such that D C\ F = dF and the curve dD does not bound a 
disc in F. I? is called a compressing disc for F, since cutting F along dD and 
attaching two parallel copies of D to the two boundary circles of the resulting 
surface transforms F to a simpler surface F'. Doing that as long as possible, we 
get a collection of disjoint 2-spheres. 

Now, if we have a submanifold M of i?^, we can apply this procedure to the 
boundary dM of M. Each time when the compressing disc is in M, we cut off 
a solid tube, i.e., perform the inverse of Move A. If the compressing disc lies 
in the outside of M, we fill a tubular hole, i.e., perform the inverse of Move B. 
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To finish the argument, we simply go in the inverse direction: instead of cutting 
tubes and filling holes, we drill holes and attach tubes. 

The paper is based on my talk given at the winterschool on Digital and 
Image Geometry at Dagstuhl, December 2000. I thank the organizers for the 
possibility to participate in that very interesting school. The paper is written 
under financial support of RFBR (grant N 99-01-00813), INTAS (project 97- 
808), and the program “Universities of Russia”. 

References 

1. Casler, B. G.: An embedding theorem for connected 3-manifolds with boundary. 
Proc. Amer. Math. Soc. 16(1965), 559-566. 62, 63 

2. Fomenko, A. T., Matveev, S. V.: Isoenergetic surfaces of integrable Hamiltonian 
systems, enumeration of 3-manifolds in order of increasing complexity and volume 
computation of closed hyperbolic 3-manifolds. Usp. Math. Nauk 43(1988), 5-22 
(Russian; English transl. in Russ. Math. Surv. 43 (1988), 3-24. 68 

3. Kirillov, A. N., Reshetikhin, N. Yu: Representations of the algebra Uq{sh), 
q-orthogonal polynomials, and invariants for links. Infinite dimensional Lie algebras 
and groups. Adv. Ser. in Math. Phys. (V. G. Kac, ed.) 7(1988), 285-339. 71 

4. Klette, R.: Cell complexes through time. Proc. Vision Geometry IX, San Diego, 
SPIE-4117(2000), 134-145. 59 

5. Kovalevsky, V. A.: Finite topology as applied to image analysis. Computer Vision, 
Graphics, and Image Processing 46(1989), 141-161. 59 

6. Martelly, B., Petronio, C.: 3-Manifolds having Complexity at Most 9. Geometry 
and Topology, 2001 (to appear). 68 

7. Matveev, S. V.: Transformations of special spines and the Zeeman conjecture. Izv. 
AN SSSR 51(1987), N 5, 1104-1115 (Russian; English transl. in Math. USSR Izv. 
31(1988), N 2, 423-434). 63, 66 

8. Matveev, S. V.: Complexity theory of 3-manifolds. Acta Appl. Math. 19(1990), N 
2, 101-130. 68 

9. Matveev, S. V.: Computer Recognition of Three Manifolds. Experimental Mathe- 
matics. 7(1998), N 2, 153-161. 68, 69 

10. Milnor, J.: Groups which act on S" without fixed points. Amer. J., Math., 79 
(1967), 623-630. 68 

11. Piergallini, R.: Standard moves for standard polyhedra and spines. III Convegno 
Nazionale di Topologia Trieste, 9-12 Giugno 1986(1988), 391-414. 66 

12. Seifert, Y.:Topologie dreidimensionalen gefaserter Raume, Acta Math. 60(1933), 
147-238. 68 

13. Turaev, V. G., Viro, O. Y.: State sum invariants of 3-manifolds and quantum 
6j'-symbol. Topology 31(1992), N 4, 865-902. 69 

14. Waldhausen, F.: Eine Klasse von 3-dimensionalen Mannigfaltigkeiten I, II, Invent. 
Math., 3(1967), 308-333; Invent. Math., 4(1967), 87-117. 68 
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Abstract. In this paper, a sound notion of “continuous” multifunctions 
for discrete spaces is introduced. It turns out that a multifunction is 
continuous (= both upper and lower semi-continuous) if and only if it 
is “strong”, or equivalently it takes (original) neighbours into (induced) 
neighbours w.r.t. Hausdorff metric. Generalizing various previous results, 
we show that any finite contractible graph has the almost fixed point 
property (afpp) for strong multifunctions. We then turn to the fixed 
clique property, or fcp (a desirable strengthening of afpp). We are able 
to establish this property only under much more explicitly geometric as- 
sumptions than in the case of afpp. Specifically, we have to restrict to 
convex-valued multifunctions, over spaces admitting a suitable notion of 
convexity; and, for the best results, to the standard n-dimensional digital 
space with (S”' — l)-adjacency. In this way we obtain what may be con- 
sidered (with reservations, discussed in the Conclusion) as an analogue 
of the classical Kakutani fixed point theorem. 



1 Introduction 

In [17], a natural analog of the e-5 definition of continuous functions for 2n- 
adjacent n-dimensional digital pictures has been studied and investigated. It 
was also shown that any finite (3" — l)-adjacent n-dimensional digital picture 
has the almost fixed point property. In [3] , Boxer continued Rosenfeld’s investi- 
gation for 2n-adjacent n-dimensional digital pictures and examined several im- 
portant classes of continuous functions including homeomorphisms, retractions 
and homotopies. 

In this paper we shall investigate some like topics, but in the more general 
setting of many- valued functions. As indicated in [21], there is a special need for 
multifunctions on discrete spaces in approximating ordinary continuous functions 
on classical (topological) spaces. To be more specific, it is not hard to see that 
multifunctions appear more naturally than single-valued functions in discrete 
(digital) structures. The point may perhaps be made clear via a simple example: 

* The second author has been partially supported by EPSRC grant on “Digital Topol- 
ogy and Geometry: an Axiomatic Approach with Applications to GIS and Spatial 
Reasoning” . 
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consider a continuous function / from unit interval I to I defined by 

_ f 2x, 0 < X < 0.5 

= \2-2x, 0.5<a;<l. 

It is easy to see that during the approximation of /, the discrete rep- 
resentations / and / of / may become multifunctions, i.e. / : —f K 4 , 

00 { 00 , 01 , 10 }, 01 { 01 , 10 , 11 }, 10 { 01 , 10 , 11 } & 11 { 00 , 01 , 10 }; 

and f : Kg ^ Kg, 000 {000,001,010}, 001 {001,010, 011,100}, 

010 {011,100,101,110}, Oil {101,110,111}, 100 {101,110,111}, 

101 {Oil, 100, 101, 110}, 110 {001, 010, on, 100} & 111 {000, 001, 010}. 

Fig. 1 shows the diagrams of / and /. Hence the discrete representations of a 
function may become many-valued rather than single- valued. 



11 

10 

01 

00 



111 

110 

101 

100 

oil 

010 

001 

000 



00 01 



10 11 



000 001 010 oil 100 101 110 111 



Fig. 1. The diagrams of / and / 





As in the cited studies of Rosenfeld and Boxer, we take graphs (perhaps of 
some restricted class) as our underlying model of digital space. In looking for a 
notion of “well-behaved” multifunction between graphs (defined over vertices), 
it is apparent that there is a range of possibilities. Specifically, if / maps vertices 
of graph G to sets of vertices of graph H, we may ask for either a weak or strong 
relation to hold between f{x) and /(y) (assumed non-empty), when x is adjacent 
to y in G: 

Weak there exist a vertex in f{x) and a vertex in /(y) which are adjacent 
(in H). 

Strong every vertex in f{x) is adjacent to some vertex in /(y), and vice versa. 

(There is at least one intermediate possibility as well.) In previous 
work [19,20,21,24] we have shown that the weak relation is appropriate for dig- 
ital approximation of ordinary continuous functions. On the other hand, the 
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strong relation might for example be appropriate for the digital representation 
of smooth motion of bodies. 

In this paper we focus on the “continuous” multifunctions corresponding to 
the strong relation. In Section 2 "we give several alternative characterizations 
of the multifunctions in question, especially in terms of upper and lo'wer semi- 
continuity in closure spaces. Section 3 contains the main result of the paper: all 
finite contractible graphs have the almost fixed point property (afpp) "with re- 
spect to these (multi)functions. This generalizes previous afpp results for single- 
valued functions, including that of Rosenfeld [17]. In the remainder of the paper 
■we compare the afpp with another adaptation (suitable for digital spaces) of the 
fixed point property, namely the fixed clique property (fcp). It turns out that, 
in contrast with the single-valued situation, there is a sharp divergence between 
the afpp and the fcp for multifunctions. We are only able to secure the fcp when 
severe restrictions, of a “geometrical” nature, are placed on both the spaces and 
the (multi)functions in question. 

2 Continuous Multifunctions 

We briefly review the concept of strong multifunctions: by a graph G we mean 
a set X of vertices (or points), with a reflexive and symmetric relation E the 
edges, also known as a tolerance graph in [12]. For graphs Gi and G 2 , a mapping 
/ : Gi — *■ G 2 is a graph homomorphism if / maps vertices of Gi to vertices 
of G 2 , preserving the tolerance relation. By a multifunction / : Gi — *■ G 2 'we 
understand a function that assigns to a vertex x of Gi a non-empty subset /(x) 
in G 2 . 

Definition 1. Let G = (X,E) be a graph. The strong power graph ofG, Vs{G), 
has the non-empty finite subsets of X as vertices, and the edge set Eg, where 

AEgB £ A,3y G B, xEy & Vy G R, 3x G A, xEy. 

Notice that the relation Eg is the conjunction of the two “weak power rela- 
tions” Eo,Ei (in the notation of Brink [4]), defined by: 

AEqB Vx G a, 3y G B, xEy 

AEiB Vy G B, 3x G A, xEy. 

Of course, since we have assumed E to be symmetric, we have E\ = E^^. Some 
remarks on the non-symmetric (digraph) case follow Proposition 3. 

Definition 2. Let G and H be graphs. The multifunction f from G to H , f : 
G H , is strong if regarded as a (single-valued) mapping into Vg(H), it is a 
graph homomorphism. 

Strong multifunctions may be simply characterized as follows: 
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Proposition 1. The multifunction f \ G ^ H is strong if and only if for any 
pair of vertices x,y ofG, we have hH{f{x),f{y)) < 1, where hn is the Hausdorff 
metric induced from the natural graph metric du . □ 

In order to compare this property with the usual continuity conditions for 
multifunctions in topology, it is useful to consider closure spaces [5] : 

Definition 3 (Cech). A closure space is a pair (X, cl), where cl : V{X) 
V{X) is the closure operator which assigns to each subset S C X a subset 01(5) 
called closure of S, satisfies 

(1) cl(0) = 0 

(2) 5'Ccl(5') 

(3) cl(5'UT) =cl(5') Ucl(T). 

A topological space is a closure space in which the closure is idempotent. On 
the other hand, we shall view a graph G = (A, E) as a closure space by taking 
cl as: 

cl(S') = {x € X \ 3y € S, xEy}. 

(It is easy to characterize graphs as closure spaces satisfying certain conditions, 
see [18].) Exactly as in topology, the interior operator of a closure space (A, cl) 
is defined by: 

int(S') = (cl(S'^))^ 

Notice that in a graph (A, E) we have: 

int(5) = {y e A I N{y) C 5}, 

where the {!-) neighbourhood N(y) is {a: S A j xEy}. 

The basic notions of topology generally extend to closure spaces, provided 
we formulate them in terms of interior and closure, rather than in terms of open 
and closed sets. Thus we have the following definition for the standard continuity 
conditions on many-valued functions: 

Definition 4. Let A, Y be closure spaces, and f : X Y a multifunction. 
Then f is said to be lower semi-continuous (Isc) if f{cl{S)) C cl(/(5')) for all 
S C A; and upper semi-continuous (use) i/ /“^(int(T)) C int(/“^(T)) for all 
T<ZY, where f-\T) = {a: € A | f{x) C T}. 

Taking graphs as closure spaces (as above), we then have: 

Proposition 2. Let f : (A, E) (T, E') be a graph multifunction. The follow- 
ing are equivalent: 

(1) f is strong; 

(2) / is Isc; 

(3) / is use. 
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Proof. Notice that, in the case of a graph -with edge relation R, A C cl{B) if 
and only if \/x € A,3y G B,xRy, that is, if and only if ARqB. Thus, lo-wer 
semi-continuity of / reduces to the condition: 

\/x,y G X : xEy f{x)EQf{y). 

By the symmetry of E and E' , this is indeed equivalent to the statement that / 
is strong. Hence (1) (2). 

Similarly, upper semi-continuity of / reduces to 

\/x,y G X : xEy ^ f{x)E[f{y), 

so that, again by symmetry of E and E' , we have (1) (3). □ 

The equivalence of upper and lo'wer semi-continuity is a consequence of sym- 
metry. The preceding definitions and (part of the) proof make sense also for 
reflexive digraphs, and it is easy to see that in that context 'we obtain: 

Proposition 3. A graph multifunction f : G El , where G, El are reflexive 
digraphs, is strong if and only if it is continuous {that is, both upper and lower 
semi- continu ous). □ 

What happens in the case of non-reflexive digraphs? In that case, the cor- 
responding “topological” structures are not closure spaces as ordinarily under- 
stood. The same kind of analysis is still possible, ho'wever: see [18] for a discussion 
of “neighbourhood spaces” in relation to digraphs. 

The “strong” relation Eg has been studied in the algebraic literature as the 
po'wer of a (binary) relation E. It can be generalized to n-ary relations, see 
Brink [4]. Multifunctions based on this and other po'wer constructs (but for 
partial orders rather than relations in general) have been studied extensively 
in domain theory/denotational semantics: see [4,22] for a survey. In [20,21] we 
investigated this topic in the context of graphs and simplicial complexes. 



3 Strong Almost Fixed Point Properties 

Let G = {X, E) be a graph and f : G ^ G a self-mapping multifunction. Then 
we say that a; € X is an almost fixed point of / if x is adjacent to some element of 
/(x). Notice that this condition can be written as xEf{x), if we extend the edge 
relation E to subsets A,B of X by: AEB ^ 3a G A,3h G B, aEh (in effect, the 
“weak” relation mentioned in the Introduction). Let ^ be a property of sets of 
vertices (points) of G. Then G is said to possess the (jy almost fixed point property 
(<))-afpp) if for any multifunction f : G ^ G which sends each vertex of G into 
a subset of G satisfying (f (/ is said to be a ef-multifunction in this case), / has 
an almost fixed point. 

A finite graph G = {X, E) is said to be contractible (or dismantlable) if there 
is a linear ordering < on X, say (xi,X 2 , . . . ,Xm), such that, for all i < m — 1, 
there exists a y differs from Xi, in the subgraph of G induced by {xj G X \ j > i}, 
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satisfying N(xi) C N(y) (in this case, it is said that y dominates Xi, or that Xi 
is dominated by y) . Informally, the idea is that the graph may be contracted (or 
retracted) step by step to a vertex. For the “continuous” treatment in terms of 
homotopy, establishing the connection with topology, see [12,13,16]. 

Theorem 1. Every finite contractible graph has the almost fixed point property 
for strong multifunctions (strong afpp). 

Proof. Let G = (X, E) be a finite contractible graph and / : G — > G a self- 
mapping strong multifunction. Since G is contractible, there exist x,y G X 
such that N(x) C N(y) (Condition 1), and the induced subgraph G' = (X \ 
X, E'), E' = E f] (X \ x X X \ x) is contractible (Condition 2). 

We prove by induction on ffX that / has an almost fixed point. Initial step: 
It is clear that G has the strong afpp when ffX = 1 or 2. Inductive step: We 
assume that G has the strong afpp when ffX < m G IN, and we claim that G 
has the strong afpp when ffX = m-l- 1. Define the multifunction g : G' — *■ G' by 

" I (/(«) \ U M, xGf(u). 

We claim that 5 is a strong multifunction. Indeed, let a,h G X\x such that 
aE'b. We have aEb and hence f(a)Esf(b). Now g(a),g(b) are obtained from 
f(a),f(b) by replacing any occurrences of x by y; since N(x) C N(y), it is 
clear that such a replacement cannot cause E( to fail. That is, f(a)Esf(b) 
g(a)E(g(b). Thus g is strong. 




Therefore by our inductive assumption and Condition 2, there exists a 
z G X\x such that z is an almost fixed point of g, i.e. zE'g(z). Then (since f(z) 
differs from g(z) by, at most, having x in place of y) z must be an almost fixed 
point also of /, except in the following case: y is the only vertex of g(z) adjacent 
to z,xG f(z), but y ^ f(z), see Fig. 2. Since yEz, f(y)Esf{z). So f(y) contains 
a vertex, say w, adjacent to x, hence adjacent to y (•.■ N(x) C N(y)). Thus y is 
an almost fixed point of /. □ 
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Corollary 1. Every finite eontraetible graph has the afpp for single-valued graph 
homomorphisms. □ 

In Euclidean geometry, the “fixed point property” is linked "with Euclidean 
convexity structure by the Kakutani fixed point theorem: in 1941, Kakutani gen- 
eralized Brou'wer’s fixed point theorem for closed n-balls to multifunctions by 
sho'wing that any closed n-ball has the fixed point property for use closed convex- 
valued multifunctions [1,2]. 

At first sight, this seems to have no bearing on -what -we are doing here. Ho'w- 
ever, in the next section "we shall go on from the afpp to the fixed clique property 
(fcp) . We "will find that the fcp fails badly for general continuous multifunctions, 
and that the Kakutani theory gives useful guidance on the conditions that need 
to be imposed to get positive results. 

4 Strong Fixed Clique Properties 

If S' is a set and D = {ci, . . . ,Cfc} is a collection of distinct non-empty subsets 
of S -whose union is S (a cover of S), we say G = (A, E) is the intersection graph 
oi D \i X = D with Ci and Cj adjacent whenever i j and Ci fl Cj yf 0 [8]. 

By the induced graph of an n-dimensional digital space (cellular or grid point 
space), we mean the appropriate neighbourhood graph in the case of a grid, and 
the intersection graph of the closed cells of highest dimension in the case of a 
cell complex (for information on digital topology/geometry and neighbourhood 
graphs, we refer [9,10]). 

It is clear that the induced graph of a finite n-dimensional regular orthogo- 
nal grid (with 2n-adjacency) does not have the afpp. Therefore, for his almost 
fixed point theorem in [17], Rosenfeld allowed also “diagonal” neighbours (giv- 
ing (3" — 1 (-adjacency). In terms of our preferred underlying model, namely 
the regular (finite) n-dimensional cell complex (forming a cube), this amounts 
to working with the intersection graph of the closed n-cells, which we denote 
by K^. 

Definition 5. Let G = (X, E) be a graph, and f : G G a f-multifunction. 
Then a clique G C X is said to be fixed by f if G C f(G). The graph G is said to 
possess the <()-fixed clique property (<()-fcp) if for any (^-multifunction f : G ^ G, 
there exists a clique C of G such that C is fixed by f. 

The notion of (p-icp clearly generalizes the notion of fcp for single-valued 
graph homomorphisms as in [11,12,13,15]. Furthermore, any singleton in the 
graph is a clique: 

Proposition 4. Let G be a graph. Then 

(1) G has the <t>-fcp => G has the fcp for single-valued graph homomorphisms; 

(2) G has the <f-fcp G has the (f-afpp. □ 

Rosenfeld’s almost fixed point theorem can be generalized by 
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Theorem 2. has the fcp for single-valued graph homomorphisms. 

Proof. Since the lattice points (with usual coordinates) of itT” can be listed 
(ordered) by lexicographic ordering <, and it is clear that (A"",<) satisfies the 
hypothesis of contractible graphs, we have: AT" is contractible. The result then 
follows from the fact that every finite contractible graph has the fcp for single- 
valued graph homomorphisms in [12,13]. □ 

The following examples suggest that, in contrast with the single- valued situ- 
ation, some consideration of convexity will be needed in studying the (f>-fcp: 

Example 1. As in Fig. 3, we define / : W 4 —!■ W 4 by a 1 -^ {c, d}, h 1 -^ {a, d}, 
c I— > {a, b}, d 1 -^ {&, c} and e 1 -^ {a, b, c, d}. Then clearly / has no fixed clique. 




Fig. 3. W 4 has no strong fixed clique property 



Example 1 shows by a counter-example that finite contractible graphs have 
no strong fcp for multifunctions. In fact, more is conveyed by it, that is, Helly 
graphs [15] do not have the strong fcp for multifunctions. Note that there is a 
“hole” in the image of e, /(e). 

Let G = {X, E) be a graph, and A a subset of X. Then A is said to be a 
geodesic convex set of G if it contains the set of vertices of any geodesic, i.e. 
shortest path joining any two vertices in A. 

Example 2. As in Fig. 4, it is easy to see that g : A'f — > A'| is a strong geodesic 
convexity multifunction, where AT| is with length 2, i.e., ffK^ = 3. Clearly, g 
does not fix any clique of AT|. 

The last example shows that K",n > 2, does not have the fcp for strong 
geodesic convexity multifunctions. To obtain positive results, we shall need to 
consider other notions of convexity than this geodesic convexity. The following 
is a more-or-less standard notion of convexity structure in a graph: 

A graph convexity [6,7] on a connected graph G = {X,E) is a pair {X,C), 
where C C V{X) is an algebraic closure system of “convex” subsets of X, such 
that 
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1,3, 7, 9 


{5} 
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{6,8} 
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{2,6} 
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{4,8} 


8 


{2,4} 



Fig. 4. g has no fixed clique in 



(1) 0,xec 

(2) Pi € C for any D C 

(3) U T e C for any chain T C C 

(4) C induces a connected subgraph of G for any C gC. 

Suppose no'w that G is finite (and connected). In this case condition (3) 
and the algebraicity property are superfluous. Let M be the collection of all 
m-neighbourhoods, m C IN, of all vertices of G; and let ■(G) be the closure 
under intersection of M. Then -we have: 

Lemma 1. ■(iC") is a graph convexity. □ 

Lemma 1 can be extended to Helly graphs [23]. We may call B(G), the 
neighbourhood convexity of the graph G. For iC", it is clear that ■(iC") contains 
0, iC", any singleton, any vertical, horizontal line and any rectangle of iC": in 
effect, the convex sets are just the rectangular blocks. 

Proposition 5. Let f : iL" — > iL" be a self-mapping strong multifunction map- 
ping each vertex to a neighbourhood convex set of K" . Then there exists a single- 
valued graph homomorphism f : iF" such that, for any vertex x of iF", 

f{x) e /(x). 

/ may be said to be a continuous selection of /. For definition of continuous 
selection on topological spaces, we refer [2]. 

Proof (of Proposition 5). If iF" = {X, E), define / : iF" ^ iF" by 

f{x) = inf(/(x)) 

for any x € X. That is, /(x) = (inf(Pi'i(/(x))), . . . , inf(Pr„(/(x)))), -where Pr^ 
is the projection of iF" onto its factor. 

Clearly, for any x € X, f{x) is unique and non-empty (hence well-defined), 
satisfying /(x) € f{x). We show that / is a graph homomorphism. Indeed, for 
any neighbourhood convex sets A and B, if AEgB, then (inf ( A) ) if (inf (B)): If 
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w € A (in particular, if w = inf(A)), then 3z & B such that Prj( 2 :) — Prj(w) < 1, 
for all j G IN, 1 < j < n. Hence inf(Prj(i?)) — inf(Prj(H)) < 1. By symmetry, 
|inf(Prj(H)) — inf(Prj(_B))| <1. □ 

Proposition 5 may be seen as a special case of digital Michael’s continuous 
selection theorem for the n-dimensional unit cube /" (see also [2]). 

Corollary 2. /tl” has the fcp for strong multifunctions which map each vertex 
to a neighbourhood convex set of K’’ . 

Proof. By Proposition 5 and Theorem 2. □ 



5 A Kakutani Analogon? 

The “block” convexity considered in the last section has some good mathematical 
properties, but is obviously too restrictive for many applications. In this section 
we focus on the digital space iG”, and by taking advantage of the geometry of iG" 
we shall be able to obtain (positive) results for much more liberal and realistic 
notions of convexity than before. 

Let /3(iP") be the boundary of iG", for example, x U 

{1, X and e(iL") = A""'\/3(iP") the core of AT". See Fig. 5 for example 

of and j3{e{K‘^)). For any x = (xi, . . . , a:„) G /3(AT"), there exists an 

unique y = (j/i, . . . , y„) G /3(e(A""-)), namely yi = Xi'iil <i < yi = Xi + 1 

if z = 1 and yi = Xi — 1 if i = #AT^, such that y dominates x, i.e. N{x) C N{y). 
(In the case that = 1, then everything is trivial.) Therefore the mapping 

r : AT" — > £r(AT”),x ^ x ii x ^ f3{K’’-),x i— > y if a; G /3(AT"), is a retraction; we 
shall call it a border retraction (of AT"). 





Fig. 5. /3(AT^) and /3(e(AT^)) 
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We shall -work -with a class RC of “convex” subsets of iC", required to satisfy 
the follo'wing properties: 

(1) RC is closed under border retractions. 

(2) Every A € RC is 2n-connected. Moreover, if x,y € A G RC, then x and y 
can be connected by a 2n-adjacency path tt in A, such that every ver- 
tex (cell) z = (zi, . . . ,z„) of 7T is “coordinatewise” between x and y. that 
is, zi S [xi, j/i], . . . , z„ S [x„, y„] (equivalently, tt is contained in the smallest 
rectangular box which has x, y as vertices). 

For example, if vertices x,y are diagonally adjacent, then {x,y} is not RC- 
convex (because not 2n-connected) . Again, in the 2-dimensional case, if x and y 
lie in the same horizontal or vertical line, and x,y G A G RC, then every vertex 
between x and y is also in A. 

A suitable choice for RC may be arrived at as follows. Recall that for a 
given subset S in the Euclidean space IR", the outer Jordan digitization of S 
(w.r.t. AT"), denoted by is the collection of cells (considered as closed) 

having a non-empty intersection with S [9]. 

Definition 6. Let A he a subset of K". Then A is a digital convex (DC) set 
in if there exists an Euclidean convex set S C IR" such that A is the inter- 
section graph of J^(S). 

Digital convex sets are not closed under taking border retractions, for ex- 
ample, in Fig. 6, the set A is a digital convex set, but r{A) is not. Clearly, any 
digital convex set has the property (2). Moreover, this property will be preserved 
(starting with a digital convex set) under taking border retractions. Thus, we 
may take for RC the class of those sets S such that S can be obtained from a 
digital convex set by a finite sequence of border retractions. 




Fig. 6. A is a digital convex set but r(A) is not 



In the following, we assume the collection RC (of the “convex” sets) to be 
fixed. 
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Theorem 3. has the fcp for stronq multifunctions which map each vertex 
to a RC-set of . 

The following definition and lemma are useful for the proof of Theorem 3: 

Definition 7. The fixed clique C {fixed by f) is said to be irreducible if there is 
no proper subset C of C satisfying C C f{C). 

Lemma 2. Let G = {X, E) be a graph and f : G G a self-mapping multi- 
function. If C C X is an irreducible fixed clique of f, then for any x £ G there 
exists one and only one y € C such that x G f{y). 

Proof. Define a directed graph H = {G, R) by 

uRv = V G f{u) 

for all u,v G G. Clearly any cycle in H determines a fixed set (clique) of /; 
since G is irreducible, every cycle is Hamiltonian. It is then easy to see that each 
vertex of H has an unique predecessor. Indeed C consists of a single cycle, and 
the function f : G ^ G obtained by restricting and corestricting / to C is a 
bijection. □ 

Proof (of Theorem 3). The following basic fact is used in the remainder of the 
proof {E is the edge relation of K"): 

(•) If A, B are any two subsets of it!" such that AEgB, then r{A)Egr{B) (be- 
cause the composition of two strong multifunctions is a strong multifunction, 
see [20,21]; and every single- valued graph homomorphism is a strong multi- 
function) . 

Let / : iL" — > iL" be any self-mapping strong RC- multifunction, that is, a 
strong multifunction which maps each vertex of RT" to a RC-set of RT" . We show 
that / has a fixed clique by using induction on the length of RT", denoted by 
£(RT"). Initial step: It is clear that RT" has the fcp for strong RC-multifunctions 
when f{K'^) = 0 or 1. Inductive step: We assume that R'" has the fcp for strong 
RC-multifunctions when £{K'^) < m — 1 G IN, and we claim that RT" has the 
fcp for strong RC-multifunctions when £(RT") = m. Define the multifunction 
g : £{K'^) e{K'^) by 

g{x) = (rof){x). 

By the “basic fact” (•) of RT", it is easy to see that 5 is a strong RC- 
multifunction. Therefore by our inductive assumption, g has a fixed clique G C 
er(R'"). W.L.O.G., we may assume that C is irreducible. We consider the case 
that C n /3(e(R'”)) yf 0 (otherwise, we are done): 

Case 1. ffG = 1: If C = {x}, we consider the case that x ^ f{x) only. Clearly, 
X G f3(s(K")), therefore there is a dominated vertex y G /3(RT") of x, such 
that y G f{x). Since / is strong, and xEy, there exists & z G f{y) such that 
zEy (and hence zEx). Again, since {x,y,z} is a clique, and / is strong, we 
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claim that there exists a, w G f{z) such that {cc, y, z, tc} is a clique. Indeed, 
since / is strong, there exist W\,W 2 € f{z) such that wiEy (hence w\Ex) 
and W 2 EZ. Then by the property of RC-sets, there exists a w € f{z) such 
that {x, y, z, tc} is a clique. Finally, since iC" is finite, then by repeating this 
procedure, there must exist a fixed clique (containing x or not) of /. 

Case 2. C C /3(e(iC")) -with yf 1: This can be dealt -with by an extension 
of the argument for the Case 1; details of the proof are omitted here. 

Case 3. Let x, y be vertices in C such that y G g{x). We only need to consider 
the case that y G /3(e(RT”)). If y G f{x), then nothing needs to be shown. 
Hence we suppose y ^ f{x), that is, there is a dominated vertex z G P{K^) 
of y, such that z G f{x). Since / is strong, there exists some vertex v G 
C n e(e(iF”)), u G f{u) for some u G C, such that vEf{x). Hence there 
exists w G M, where M is the maximal clique containing C, such that 
w G f{x) (otherwise, this would violate the structure of f{x) since f{x) is a 
RC-set). Since iF" is finite, then by repeating the procedure described above, 
we conclude that there must exist a fixed clique (CM). □ 

Corollary 3. iF" has the fcp for strong multifunctions which map each vertex 
to a digital convex set of K^. □ 

6 Conclusion 

Taking our cue from Pultr [13], we might describe Theorem 3 as an “analogon” 
of the Kakutani fixed point theorem. In [21] we proposed a precise criterion for 
saying that a given digital theorem is a version of a classical (continuous) the- 
orem. That criterion requires, in particular, that the classical theorem can be 
recovered by a limiting process from the digital one. This is by no means the 
case for Theorem 3 in relation to Kakutani’s Theorem. For an adequate digital 
version, in the strict sense, we need to work with weak rather than strong multi- 
functions: see [21]. In this paper we have chosen to study strong multifunctions; 
one of the advantages of this choice is that, unlike in [20,21], we are able to give 
purely combinatorial proofs for all our results. 

It is our belief that, in developing digital topology /geometry, single- valued 
functions are unduly restrictive. They are not flexible enough: no “stretching” is 
possible with them. (Poston [12] writes of “chain mail geometry”.) This circum- 
stance, we conjecture, explains why relatively little has been done with functions. 
We hope that multifunctions will be developed and used more in the future. 
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Abstract. The aim of this paper is to present the design of a spatial 
modeling tool, called SpaMod, that is currently developed in Poitiers 
(France). SpaMod will allow users to represent and manipulate both 
discrete and continuous representations of geometrical objects. It is a 
topology based geometric modeling tool with three types of embeddings: 
the classical Euclidean embedding, the discrete matrix embedding (voxel 
or inter-pixel) and the discrete analytical embedding (discrete objects de- 
fined by inequations). In order for such a tool to fulfill its role, all three 
embeddings have to be available together. Conversions between embed- 
dings is thus an important however, in 3D, still partially open question. 

Keywords: Modeler, Discrete, Reconstruction, n-G-map 



1 Introduction 

With the present paper we undertake an investigation aimed at proposing a 
software platform, called SpaMod that stands for Spatial Modeling Tool, that 
will allow us to represent and manipulate both discrete and continuous represen- 
tations of geometrical objects. For fourty years software plateforms have been 
developed to create and manipulate computer models that aim at reproducing 
real world objects and simulating real world phenomena. They are unavoidable 
in any industrial manufacturing process. The objects that are manufactured 
nowadays are almost all designed with the help of some modeling tools, may 
it be a screw or an airplane. However, most available softwares are only able 
to handle either discrete (medical imaging, image processing libraries, etc.) or 
continuous (CAD tools for example) objects. There is an important advantage 
at disposing of both representations as one can mix synthetic and acquired data 
sets. With SpaMod, we want to be able to create new discrete or continuous 
objects, to import discrete objects obtained from direct acquisition (numerical 
photography, magnetic resonance imaging, pet scan, etc.) or import continuous 
objects obtained from classical modelers. Most importantly, we want to repre- 
sent objects both in discrete and in continuous forms and be able to update all 
the representations in case of any modification. 

We have chosen to include discrete geometry into classical Euclidean 3D mod- 
eling methods, and more precisely topology-based geometric modeling methods 
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often used for Boundary Representation. Another type of modeling method has 
been chosen by Kaufman [17] that has designed a Constructive Solid Geome- 
try (CSC) type tool. The principal difference between Kaufmans’ approach and 
ours’ consists in the definition of the kernel. He considers only discrete object 
as collections of elementary elements (voxels) and performs boolean operations 
on these sets. In our approach, we consider an object as a topological structure 
embedded in a discrete or/and a continuous space. The considered operations 
are two fold: topological operations that modify the topological structure of the 
objects and embedding operations that modify their geometrical shapes. This al- 
lows us multiple different embeddings (especially both discrete and continuous) 
for a same topological entity. 

The topological layer we chose for SpaMod is based on a combinatorial model 
called chain of maps [14]. This topological model allows a very large class of 
topological objects (manifold, non-manifold, oriented, non oriented, bounded, 
unbounded, etc.) as SpaMod is not, for the moment, oriented towards a specific 
application. The topological model can be restricted afterwards relatively easily. 

Three types of embeddings are proposed in SpaMod. Each of which can 
be declined in several different forms. These three embedding types are the 
classical Euclidean embedding, the classical discrete matrix embedding and the 
new discrete analytical embedding. An analytical discrete object is a discrete 
object defined by inequations rather than as a set of discrete points. The purpose 
of this last embedding type was at first just to enable the representation of 
objects such as, for instance, discrete 3D planes of Reveilles [29]. This allows the 
integration of recognition algorithms such as the ones of I. Debled-Renesson [12], 
L. Papier [26,27] or J. Vittone [35]. It appears now that this embedding type is 
of fundamental importance because it represents a compact way of representing 
discrete data and it is an alternative (to the Marching Cube [24] for instance), 
much more compact, way of reconstructing Euclidean data sets out of discrete 
data. 

The software SpaMod we are designing is far from being finished and there 
are many problems that remain open. In this paper we explain the architecture 
of SpaMod. We address the fundamental choices we made and explain the steps 
that have to be achieved to reach the goals we have set up. We show that the 
open problems that this project has brought up are fundamental to the com- 
munity and that solving them will address applications that go far beyond this 
modeler. This modeler is also intended at being a development environment for 
the computer graphics and, more precisely, for the discrete geometry community 
for implementing and testing algorithms. 

In section 2 we present the kernel of SpaMod. We detail the two layers of 
the kernel: the topological layer and the embeddings layer. We first explain the 
topological model, the discrete analytical model and the discrete matrix model. 
Then, we end this section by explaining how the different embeddings are linked 
to the topological model. In section 3, we present the conversions between the 
different embeddings proposed in SpaMod. Then, we will eventually conclude in 
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section 4 with some open problems and what remains to be done to achieve this 
project. 



2 SpaMod’s Kernel 

The central scheme of SpaMod is organized in two layers: the topological layer 
and the embedding layer. The embedding layer is itself subdivided into three 
different types of embedding: the Euclidian embedding, the discrete analytical 
embedding and the discrete matrix embedding (Fig. 1). In figure 1, the parts in 
grey represent a classical Euclidean topology based geometrical modeling tool. As 
we have already explained, our aim is not to create a discrete-only modeling tool 
but to propose a modeler that is able to handle different types of embedding. 
The modeler is designed in such a way that the discrete embeddings can be 
added as extensions on already existing modeling softwares. Let us examine 
each component of the diagram. 




Fig. 1. Topological layer and embeddings layer 



2.1 Topological Layer 

The topological layer is a central element of the modelers’ kernel as it handles 
the topology of the objects. The different embeddings are attached onto the 
topological structure. The way this attachement is organized is presented in sec- 
tion 2.3. This layer is based on a combinatorial model that describes the objects’ 
subdivision, i.e. the set of i-cells, for i = 0,1, 2, 3 (vertex, edge, face, volume) 
and the border relationships between these cells. Different topological models 
have been proposed in [23,21,20]. In order to be able to propose a wide variety 
of objects, we have selected the chain of maps model [14]. This model is able to 
handle orientable and non-orient able, bounded and non-boimded, manifold and 
non-manifold topological objects. 
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Although the structure used is cellular, it is based on simplicial structures 
(simplicial sets) [21]. However we use cellular structures since some operations 
on a simplicial structure do not always lead to another simplicial structure, or 
at least, it would be very time-consuming. The choice of the model of chains of 
maps rests on several studies. For instance, Brisson compared this model with 
the ClF-complex one [11] and Lienhardt compared the data structures used here 
with the ones used in Boundary Representation [23]. 

SpaMod is for the moment not dedicated to specific applications and the 
topological model that we chose is therefore as large as possible to avoid limi- 
tations. The topological model can however easily be tailored down for specific 
uses as it is written in object-oriented C-| — h programming language. 

The n-chain of maps [14] explicitely describes each i-cell of the subdivision 
by using an i-dimensional generalized map. We are going to present briefly the 
basics of our topological model by providing the definition of an n-dimensional 
generalized map (or n-G-map) (Def. 1) and the notion of orbits (Def. 2). We will 
then explain the model we use. 

Definition 1. (n-G-map). An n-G-map, n gN, is defined by an [n-\-2)-tuple 
G = {D, aoi ■ ■ • ; «n); sueh that: 

— D is a finite set of darts, 

— Oq, . . . , Ora o.f'e permutations on D, such that: 

• for 0 < i < n, at is an involution^ , 

• for 0<i<i-\-2<j<n, ai o aj is an involution. 

Figure 2 illustrates an example of a 2-G-map. It shows two 2-simplices (or 
2-cells), where each 2-simplex is composed of six darts linked by ag and ai. Note 
that 2-dimensional darts, i.e. darts of 2-G-maps, will be called 2-darts. A 2-dart 
can also be linked by «2 when two 2-cells share a common edge (Fig. 2). 

Definition 2. (Orbit). Let d G D and <P a subset of the permutations 
{ao, . . . ,an}. Let us assume that (<P) is the subgroup of permutations gener- 
ated by <L. Then, the orbit of a dart d is {'T){d) = {^(d),(/) G (^)}- If^ is empty, 
then {d^){d) = {d}. 

In other words, an orbit is a set of darts representing a topological entity. 
Thus, orbits can be considered as the link between topological cells and their geo- 
metrical embeddings. Precisely, an i-cell is an orbit for 

{ofc : 0 < k < n, k ^ i}. For instance (Fig. 2), to describe the vertex Vi, 
i.e. 0-cell, whose topological representation is encircled with the dashed ellipse, 
we have to consider one of the darts in this ellipse, and apply a\, a 2 and 02 ° 0:1 
to it. You can notice that the starting dart does not matter. 

When observing the topological view of figure 2, we can easily associate each 
orbit with their corresponding geometrical entity, as summerized in table 1. 

Although the n-G-map model is powerful, we use another model in SpaMod: 
the n-chain of maps. Among all its assets, this model allows the modelisation 
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Fig. 2. On the left, a classical view of two 2D triangles T\ and T 2 sharing a 
common edge. On the right, their topological modelisation. The set of darts is 
D = {a, 6, c, d, e, /, g, h, z, j, fc, /} 

Table 1. Associations orbit/geometrical entity 



Geometrical entity 


Orbit 


vertex Vi 


{ai,a2){k) = (k,l,a,b) 


vertex V2 


(ai,a2)(c) = (c, d) 


vertex V3 


(ai,a2)(e) = (e,f,g,h) 


vertex V4 


(ai,a2>(i) = (i,j) 


edge ei 


(00,02) (6) = (b,c) 


edge 62 


(Qo,02)(d) = (d, e) 


edge 63 


(oo,02)(h) = {h,i) 


edge 64 


(oo,02)(i) = (i,fc) 


edge 65 


(oo,02)(a) = {a,f,g,l) 


triangle Ti 


(oo,oi)(a) = (a,b,c,d,e,f) 


triangle T2 


(00, 01) (5) = {g,h,i,j,k,l) 



of non-manifold objects. For instance, a tetrahedron and three edges sharing 
a common vertex (Fig. 8) is non-manifold and cannot be represented with the 
n-G-map model. We won’t recall the definition of this model as it will be too 
long, but an interested reader will find all the details in [14]. 

To conclude, we would like to stress on our choice of these models for SpaMod. 
The main reason is that they are defined for n-dimensional objects of any kind 
and therefore, they allow high modeling variety. They are also coherent as they 
are based on a unique elementary object: the dart. Moreover, the constraints 
make sure that performing any operation on an object will result in a correctly 
modeled object. 



96 



Eric Andres et al. 



2.2 Embeddings Layer 

The embedding layer decribes the embedding of each topological subdivision 
into a geometrical space, i.e. the spatial position and form of the object. Due to 
the specification of SpaMod, we propose three different types of embeddings to 
work with (Fig. 1). We present those three types embeddings in this section. 



Euclidean Embedding The first embedding model used is the classical Eu- 
clidean model. An Euclidean object is represented by real number coordinate 
points for each vertex of the topological model. The edges, faces and volumes of 
the modeled objects are interpolated from their vertices. The topological layer 
and the Euclidean embedding form a classical Euclidean topology based model- 
ing tool with all the classical operations that exist in such softwares (in grey on 
figure 1). We need this type of embeddings in order to be able to import Euclidian 
models (VRML objects for instance) for later discretization or as result of recon- 
struction algorithms (such as Marching Cube ones). SpaMod is therefore not a 
discrete-only modeling tool but a classical topology-based geometrical modeling 
tool with two extra types of embeddings that we will describe just next. For the 
moment we have limited ourself to a polygonal Euclidean embedding because, to 
the best authors’ knowledge, no convincing general discrete counterpart models 
for Beziers, splines, . . . curves and surfaces have been proposed so far. 



Discrete Analytical Embedding The aim of this representation form is to 
represent a discrete object by an analytical description, i.e. a set of analytical 
inequations. This type of representation is by definition independent of the num- 
ber of discrete points composing a discrete object. This type of embedding is new 
and central to our purpose. The aim is to be able to handle discrete analytical 
objects such as discrete lines and planes of Reveilles [28] , circles and spheres of 
Andres [1], supercover [5,2], standard [2], naive [2], ... models. Many papers 
have dealt these last decades with 2D discrete analytical straight line [16,12,32] 
and 3D plane segment [12,13,37,18,31,32,36] recognition algorithms. Our aim is 
to propose a platform where such algorithms can be developed, implemented, 
tested and finally used in industrial applications. We have implemented two 
discrete analytical embedding models so far: the supercover and the standard 
model. These two models are both discrete analytical models and geometrically 
coherent. 

Definition 3. A discrete model is said to be geometrically coherent if the dis- 
cretization of an Euclidean object A depends only on A. 

This means, for instance, that the discretization of a 3D straight line seg- 
ment D is the same if D is just an isolated 3D straight line segment, the edge of 
a 3D triangle, or even the edge of a 3D tetrahedron. This notion is not verified 
by most discrete models such as, for example, the discrete primitives proposed 
by Cohen and Kaufman [17]. 
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In SpaMod we have implemented two different discrete analytical models so 
far; the supercover model and the standard model. Let us present briefly the 
supercover model. 

Definition 4. The supercover S (E) C Z" of a general Euclidean object E C R" 
can be defined in several equivalent manners: 



where V (p) is a voxel centered on p, doo is the block distance, ® represents the 
Minkowsky sum. 

We will not detail the supercover model. This would go beyond the scope of 
this paper. The reader can find extended details on this model in [2,5]. Let us 
just state that, intuitively, the supercover of a Euclidean object E is composed 
of all the discrete points whose voxels are in contact with E. The supercover 
model is by definition geometrically coherent since the supercover of an object 
is independent of it being part or not of another geometrical structure. The 
supercover of a vertex (resp. edge) of a triangle is, by definition, the supercover 
of a point (resp. straight line segment). 

The supercover is also an analytical discrete model. It has been shown that 
the supercover of any semi-algebraic set can be described analytically [33]. Let 
us just examine, as an example, the supercover of the triangle Ti of figure 2 
(see Fig. 4 in section 2.3). The supercover of the triangle Ti is given by seven 
inequations (in this example, the equations number 17, 5, 6, 20, 11, 22 and 4) 
and this independently of its size. If one or two edges would have been parallel 
to an axis then only six or even five inequations would have been necessary to 
describe the supercover of Ti. In SpaMod, the representation of the supercover 
of a convex set corresponds simply to a set of inequations. A concave object is 
defined as unions of convex shaped objects (i.e. as union of simplices). 

One drawback with the supercover model is that it has sometimes 
bubbles. For instance, the supercover of the 2D Euclidean point (5,5) is the 
set of points {(0, 0) , (0, 1) , (1, 0) , (1, 1)}. The fact that a Euclidean point has 
four discrete points as embedding can raise some problems in some algorithms, 
especially those aimed at recognition. To avoid this problem we have introduced 
the standard discrete analytical model [2] . The standard model is obtained from 
the supercover model by changing some inequations of the supercover represen- 
tation to strict inequations. When this is done in a correct way (see [2] for more 
details) then we obtain objects without bubbles. The standard model of an Eu- 
clidean point is always an unique discrete point, the standard model of a 2D line 
is always 4-connected without simple points, a standard 3D line and 3D-plane 
is always 6-connected without simple points. This is proved in [9]. This means 
that the standard model is especially well adapted to the inter-pixel discrete 
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Discrete Matrix Embedding The model we choose for a discrete matrix 
embedding is of course the same as for the discrete analytical embedding. Both 
embedding types represent the same object. The classical way of representing a 
discrete object is to enumerate its points. This set of discrete points is contained 
in a finite matrix that represents a finite part of space. The discrete matrix in 
SpaMod embedding is proposed in two forms: 

— The first form is the classical voxel form. A discrete object is formed of voxels 
corresponding each to a discrete point in a matrix. 

— The second form is the Khalimsky-Kovaleski (K^) representation form [15], 
also called inter-pixel or abstract cell complexe representation form. A voxel 
is not represented as such, instead the voxels’ vertices, edges, faces and its 
volume are each represented explicitely. The discrete space can also easily 
be stored in a matrix. The discrete space representation has many inter- 
esting properties and will probably, in authors’ opinion, become prevalent 
over the classical voxel representation in applications. Several questions of 
course need to be addressed such as, for example: Do we just represent the 
inter-pixel border of an object or its complete cell subdivision? 

Example To conclude this section, let us consider the topological triangles T\ 
and T 2 of figure 2. The Euclidean embedding will simply consist in the four vertex 
coordinates (0,0), (3, 8), (9,1) and (12,10). The discrete matrix embedding for 
the supercover model will consist of all the (dark and light) grey points in the 
following matrix (Fig. 3). With the standard model, the three grey points marked 
with a circle need a closer look. The two points which are in the outline of T\ 
and T 2 do not belong to the triangles as they do not belong to the standard 
straight line segments (0, 0) — (9, 1). But the marked point that is on the common 
edge belongs to the triangles as it belongs to the interior of T 2 ■ 




D interior of the discrete triangle 
n discrete edge 
B discrete vertex 

0 points that belong to the supercover 
but not to the standard triangle 



Fig. 3. Discrete matrix embedding 
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2.3 Link between the Topological Layer and the Discrete Analytical 
Embedding 

We are now going to explain on an example how the link between the topological 
layer and the embedding layer is established. Then we will explain where the 
discrete analytical information is stored. 

As we have seen in the previous section, an analytical object is defined by a 
set of inequations that describe its geometrical embedding. 

It would have been a great loss of space to store those inequations in each 
dart of the corresponding orbit. That is why this information is only held by 
one dart. To obtain the geometrical embedding of an f-cell, we have to randomly 
pick up a dart of that cell and search for this special dart all over its orbit. 
A dart can contain the information related to several objects, i.e. it can hold the 
inequations of a vertex and the ones of an edge if it belongs to the two orbits. 

Let us focus on our current example, the two 2D triangles T\ and T 2 of fig- 
ure 2. They obviously share the edge 65 . Figure 2 shows the classical Euclidean 
embedded view and the topological representation of T\ and T 2 . Figure 4 il- 
lustrates the analytical view of those triangles, and shows the 26 inequations 
related to that representation. 



5 




Fig. 4. An analytical representation of T\ and T 2 with inequations numbered 
from 1 to 26. The outline is thickened 



The supercover analytical representation of these triangles is defined by the 
set of inequations given in table 2. Figure 3 illustrates all the discrete points 
verifying the inequations of table 2 . 

The eight bold inequations in table 2 define the global object (thickened 
outline of figure 4) made of T\ and T 2 . We need however all the other inequations 
too. For instance, if we want to perform an operation on T\ alone, we need 
Eqii and Eq 22 - 

The information is not duplicated. Each simplex only holds the inequations 
that are peculiar to it, and holds pointers to the others of lower dimensions. 
Table 3 illustrates that. 
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Table 2. The 26 inequations that describe Ti and T 2 



{Eq^) 


y - 17/2 < 0 


(Ega) 


a; - 7/2 < 0 


{Eqa) 


-2/ + 15/2 < 0 


{Eqi) 


-x + 5/2 < 0 


(Eqs) 


y- 21/2 < 0 


(Eqe) 


X- 25/2 < 0 


{Eqi) 


-2/ + 19/2 < 0 


{Eqs) 


-x + 23/2 < 0 


{Eqg) 


2/- 3/2 < 0 


{Eqio) 


X- 19/2 < 0 


{Eqii) 


-y -t 1/2 < 0 


(Egia) 


-a; -t 17/2 < 0 


(Eqii) 


2/ - 1/2 < 0 


{Eqii) 


a; - 1/2 < 0 


(Eqis) 


-y-l/2<0 


(Eqie) 


-X- 1/2 < 0 


(Eqir) 


-2x -1- 9y - 143/2 < 0 


{Eqis) 


2a;- 92/ -t 121/2 < 0 


(Eqis) 


—9a; -|- 32/ — 84 < 0 


(Eqao) 


9x - 3y -1- 72 < 0 


(Eq2i) 


7x + Gy- 151/2 < 0 


(Egaa) 


-7x -6y + 125/2 < 0 


(E 523 ) 


—X -|- 9// — 5 < 0 


(Eq24) 


X - 9y - 5 < 0 


{Eq2b) 


8a; -32/- 11/2 < 0 


(Eq2e) 


-8x-|- 3y - 11/2 < 0 



In table 3, Eqi {I < i < 26) is the i-th inequation (given in table 2) and 
t Eqi (1 < i < 26) denotes a pointer (in a programming view) to the i-th 
inequation. For instance, the discrete analytical embedding of triangle Ti is held 
by dart a and is composed of 7 pointers to the actual inequations. The discrete 
analytical embedding of edge 65 is also held by dart a (for example) and is 
composed of 2 inequations (the supercover line) Eq 2 i and Eq 22 , and links to the 
4 other inequations (belonging to the edge). 



Table 3. One possibility to store the inequations 



Geometrical entity 


Dart 


Inequations held 


vertex Vi 


k 


Eqi,Eq2,Eq3, Eqi 


vertex V2 


c 


Eqs, Eqe, Eqi, Eqs 


vertex V3 


e 


Eqe, Eqio, Equ, Eqi 2 


vertex V4 


i 


Eqi3, Eqii, Eqis, Eqie 


edge ei 


b 


Eqii, Eqis, )Eq 3 , 'lEqi, )Eqs, ]Eqe 


edge 62 


d 


Eqie, Eqeo, "{Eqs, "{Eqe, "{Equ, /Egia 


edge 63 


h 


Eq23, Eq 2 i, "{Eqe, /Egio, '{Eqis, '{Eqie 


edge 64 


j 


Eqes , Eqee , T Eqi , / Eqe , T Eqis , T Eqie 


edge 65 


a 


Eqei, Eq 22 , "{Eqi, "{Eqi, "{Eqio, "{Equ 


triangle Ti 


a 


{Eqi, "{Eqs, "{Eqe, "{Equ, "{Equ, {Eqeo, "{Eqee 


triangle T 2 


9 


T Eqi , { Eqio , T Eqis , T Eqie , T Eqei , T Eqei , T Eqee 
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3 Embedding Conversions 

One of the first reasons why the implementation of embedding conversions oper- 
ations are necessary is the fact that we want to be able to import objects created 
or acquired externally. When an object is imported, we need to construct the 
other, remaining, embeddings and, if necessary, a topological representation of 
the object. We propose three types of embeddings in SpaMod, thus we distin- 
guish six different conversions between embeddings as shown in figure 5. Let us 
examine these different types of conversions. 




Euclidean object import discrete object acquisition 



Fig. 5. Embedding conversions 



3.1 From Euclidean to Discrete Analytical Embedding 

To perform such a conversion is easy. We just need to compute the inequations 
given by a discrete analytical model, like the supercover one [5,7,6]. 



3.2 From Discrete Analytical to Discrete Matrix Embedding 

Again, this conversion is not much of a problem. Indeed, from a discrete ana- 
lytical representation it follows that all the points whose coordinates verify the 
inequations belongs to the discrete object. So, we can use a brute-force algorithm 
and check all the points to find those that match the set of inequations. This is 
of course not very efficient but it shows that this operation is not a theoretical 
problem. In SpaMod we generate discrete objects in the matrix representation 
by incremental algorithms, which are far more efficient. 
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3.3 Prom Euclidean to Discrete Matrix Embedding 

For a given discrete model, this conversion is usually not a problem since all 
the discrete models propose algorithms to generate discrete objects. That is also 
what we do for the supercover and the standard model. 



3.4 Prom Discrete Matrix to Euclidean Embedding 

To convert directly a discrete object, that is a set of voxels, in an Euclidean 
object is possible and several methods have been proposed, such as the March- 
ing Cubes algorithm [24]. The problem with these methods is that the number 
of polygons of the resulting Euclidean embedding is proportional to the size of 
the original discrete matrix object. This is not a compact representation of the 
Euclidean object and often requires additional steps of simplification in order to 
reduce the number of Euclidean polygons. In this case the discrete matrix object 
and the Euclidean object do not represent the same object anymore. That is why 
we propose a new alternative two step process described below. 



3.5 Prom Discrete Matrix to Discrete Analytical Embedding 

We called this operation analytical recognition^ as it permits to find the inequa- 
tions that are verified by the voxels of the discrete matrix object. This is the first 
step of the “discrete to Euclidean” analytical transformation process. This oper- 
ation has been partially studied by I. Debled-Renesson [12], L. Papier [25], J. Vit- 
tone [37], etc. The 2D case has been sucessfully solved by I. Debled-Rennesson. 
The 3D case remains however an open problem. For the moment, given a set of 
voxels, we can compute the features of the analytical plane which they belong 
to. We now want to describe totally the edges and vertices of that plane piece. 
There are several research leads. We are trying to adapt the naive objects recog- 
nition algorithm of Vittone [35] to our needs. 



3.6 From Discrete Analytical to Euclidean Embedding 

Once the inequations have been found, there are generally several Euclidean ob- 
jects that fit those inequations. That is to say, for instance, several Euclidean 
edges can result in the same discretized edge. So the reverse operation is a serious 
issue. This so-called analytical reconstruction step, the second one of the “dis- 
crete to Euclidean” analytical process, consists in finding the Euclidean object 
that fits the best. For instance, given a discrete edge, we obtain its analytical 
representation and then we choose an Euclidean edge amongst the possible ones 
(Fig. 6). There is no problem when, for instance, an edge of a triangle is not 
attached to another one. In case we have a 3D discrete edge that is part of two 
discrete triangles then we need to choose an Euclidean edge that will be part 
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of both 3D Euclidean triangles (i.e. planes). This is not easy and we have seen 
in the 2D case that there is not always a solution (Fig. 7). There are still many 
open problems here. 




Fig. 6. A discrete edge and two Euclidean edges that match 




Fig. 7. A discrete triangle and two Euclidean triangles that match 



4 Conclusion 

In this paper we have presented the architecture of a modeling tool that works 
in a topological space, in the Euclidean space and in discrete space. The soft- 
ware, called Spamod for Spatial Modeling Tool, is currently in development in 
our laboratory. We have presented the different layers and explained how we 
implemented the different embeddings. We propose three types of embeddings 
in SpaMod. The Euclidean embedding, the discrete matrix embedding (discrete 
information is stored in a matrix) and the discrete analytical embedding (dis- 
crete objects are represented by inequations). See figure 8 for an example of 
each view. The discrete analytical embedding provides a compact representa- 
tion form for discrete information. We have implemented two coherent discrete 
models that provide analytical descriptions of objects: the supercover and the 
standard model. This software is intended as discrete object creation and ma- 
nipulation tool and algorithm development platform for the discrete geometry 
community. SpaMod serves also as an important research tool. The software 
is far from finished. We have still to solve some difficult open problems. One 
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of these very important open problems is the analytical recognition of discrete 
polygons. Different research groups work already with different approaches on 
this problem. The problem is to describe a piece of a discrete analytical plane 
as a discrete 3D polygon. Solving this problem will provide a compact repre- 
sentation of discrete object and several important geometrical properties such 
as normal vectors. From the discrete polygons one can construct corresponding 
Euclidean polygons. This reconstruction process, once realised, offers compact 
Euclidean reconstructed objects out of discrete objects. This method could re- 
place the Marching Cube method efficiently. This research does not seem to be 
irrealistic to achieve as it has already been solved in 2D in medical imaging 
application [4]. 
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Fig. 8. Four screen captures, from a SpaMod session, of the same object (a tetra- 
hedron and three edges): (a) the classical view, (b) the topological modelisation, 
(c) the discrete analytical representation, (d) the discrete matrix representation 
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Abstract. A pyramid is a stack of image representations with decreas- 
ing resolution. Many image processing algorithms run on this hierarchical 
structure in C)(log(n)) parallel processing steps where n is the diameter of 
the input image. Graph pyramids are made of a stack of successively re- 
duced graphs embedded in the plane. Such pyramids overcome the main 
limitations of their regular ancestors. The graphs used in the pyramid 
may be region adjacency graphs or dual graphs. This paper reviews the 
different hierarchical data structures and introduces a new representation 
named combinatorial pyramid. 



1 Introduction 

Regular image pyramids have been introduced in 1981/82 [10] as a stack of im- 
ages with decreasing resolutions. Such pyramids present interesting properties 
for image processing and analysis such as [4]: The reduction of noise, the process- 
ing of local and global features within the same frame, the use of local processes 
to detect global features at low resolution and the efficiency of many computa- 
tions on this structure. Regular pyramids are usually made of a very limited set 
of levels (typically log(n) where n is the diameter of the input image). Therefore, 
if each level is constructed in parallel from the level below, the whole pyramid 
may be built in C>(log(n)) steps. Since 1981, regular pyramids have been widely 
used in image segmentation, shape analysis, surface reconstruction and motion 
analysis (see e.g. [25] for a survey of these applications). However, the rigid- 
ity of regular pyramids induces several drawbacks such as the shift-dependence 
problem and the limited number of regions encoded at a given level of the pyra- 
mid [4]. Irregular pyramids overcome these negative properties while keeping 
the main advantages of their regular ancestors [23]. These pyramids are defined 
as a stack of successively reduced graphs. Each graph is built from the graph 
below by selecting a set of vertices named surviving vertices and mapping each 
non surviving vertex to a surviving one [23]. Typically, such graphs represent 
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the pixel neighborhood, the region adjacency, or the semantical context of image 
objects. Since images are samplings of a plane, the graphs are embedded in the 
plane by definition. Hence they are planar and the dual graph is well defined. 

Two kinds of graphs are usually used within the pyramid framework: the 
simple graph data structure encodes each relation between two vertices by a 
single edge. The dual graph data structure is based on an explicit encoding of 
both the initial graph and its dual. 

Combinatorial map first introduced in 1960 by Edmonds [12] may be consid- 
ered as a planar graph with an explicit encoding of the neighborhood’s orienta- 
tion of each vertex. This graph encoding induces several advantages besides the 
other graph representations such as the explicit encoding of the orientation of 
the plane or an implicit encoding of the dual graph. 

This paper presents the usual hierarchical data structures in Section 2. Then 
the combinatorial maps are presented together with their main properties in 
Section 3. A combination of the combinatorial and Irregular Pyramid frameworks 
is then presented in Section 4. 



2 Hierarchical Data Structures 

2.1 Regular Pyramids 

A regular pyramid is defined as a sequence of images with exponentially reduced 
resolution. Each image of this sequence is called a level of the pyramid. The low- 
est level corresponds to the original image while the highest one corresponds to 
the weighed average of the original image. Using the neighborhood relationships 
defined on each image the Reduction window relates each pixel of the pyramid 
with a set of pixels defined in the level below. The pixels belonging to a reduc- 
tion window are the children of the pixel which defines it. Such a pixel is the 
father (filled circles in Fig. 1) of the pixels belonging to its reduction window. 
This father-child relationship maybe extended by transitivity to any level of the 
pyramid. The set of children of one pixel in the base level image is named its 
receptive field. Within the regular pyramid framework the shape and the cardinal 
of the reduction window remains the same for all pixels of the pyramid. 

Another parameter of a regular pyramid is its Reduction factor which en- 
codes the ratio between the size of two successive images in the pyramid. This 
ratio remains constant between any two levels of a regular pyramid. A reduction 
function computes the contents of a father from the contents of the pixels in 
the reduction window. A regular pyramid is thus formally defined by the ratio 
N X N/q, where N x N denotes the size of the reduction window while q denotes 
the reduction factor. Different types of pyramids may be distinguished according 
to the ratio between the size of the reduction window and the reduction factor: 

— If iV X N/q < 1, the pyramid is named a non-overlapping holed pyramid. 

Within such pyramid, some pixels have no fathers [15] (e.g., the center pixel 

in Fig. 2(a)). 
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Fig. 1. A 2 X 2/4 regular pyramid 



□ 



— If X N/q = 1, the pyramid is called a non overlapping pyramid without 
hole{see e.g. in Fig. 2(b)). Within such pyramids, each pixel in the reduction 
window has exactly one father [15]. 

— If X N/q > 1, the pyramid is named an Overlapping pyramid (see e.g. in 
Fig. 2(c)). Each pixel of such a pyramid has several potential parents [10]. 
If each child selects one parent, the set of children in the reduction window 
of each parent is re-arranged. Consequently the receptive field may take any 
form inside the original receptive field. 



2.2 Irregular Pyramids 

Beside their interesting properties mentioned in Section 1, regular pyramids have 
several drawbacks. Indeed, the fixed size and shape of the reduction window 
together with the fixed value of the reduction factor induce a poor ability of 
regular pyramids to adapt their structure to the data. The rigidity of regular 
pyramids induces several drawbacks such as the shift-dependence problem and 
the limited number of regions encoded at a given level of the pyramid [4] . 




a) 2 X 2/5 




Fig. 2. Three different types of regular pyramids 
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Irregular pyramids overcome these negative properties while keeping the main 
advantages of their regular ancestors [23] . These pyramids are defined as a stack 
of successively reduced graphs (Go,Gi,G 2 ,G 3 in Fig. 3). Each graph is built 
from the graph below by selecting a set of vertices named surviving vertices 
and mapping each non-surviving vertex to a surviving one [23]. Therefore each 
non-surviving vertex is the child of a surviving one which represents all the 
non surviving vertices mapped to it and becomes their father. If the initial 
graph Go = (Vq; is defined from the regular grid, we may associate one pixel 
to each vertex (in Vq) £md link vertices by edges in Eq according to the adjacency 
relationships defined on the regular grid(Go in Fig. 3). 

The reduction operation was first introduced by Meer [22,23] as a stochastic 
process. Using such framework, the graph G;+i = (Vj-i-i, -Uj+i) defined at level 
Z -I- 1 is deduced from the graph defined at level I by the following steps: 

1. The selection of the vertices of G;+i among Vi. These vertices are the sur- 
viving vertices of the decimation process. 

2. A link of each non surviving vertex to a surviving one. This step defines a 
partition of Vi . 

3. A definition of the adjacency relationships between the vertices of G;+i in 
order to define E[+i. 

2.3 Selection of Surviving Vertices 

In order to obtain a fixed decimation ratio between each level, Meer [22] imposes 
the following constraints on the set of surviving vertices: 

\/v gVi - Vi+i 3v' e Vi+i : (u,U) € Ei (1) 

{v,v')^Ei ( 2 ) 

Constraint (1) insures that each non-surviving vertex is adjacent to at least 
a surviving one. Constraint (2) insures that two adjacent vertices cannot both 
survive. These constraints define a maximal independent set (MIS). 

Meer [22] defines a set of surviving vertices fulfilling the maximal independent 
set requirements thanks to an iterative stochastic process: The outcome of a 







Fig. 3. An irregular pyramid using the 4-adjacency of pixels 
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random variable uniformly distributed between [0, 1] is associated to each vertex. 
Then each vertex associated to a local maximum of the random variable survives. 
This process is iterated on the remaining non-surviving vertices until the set 
of surviving vertices fulfills the maximal independent set requirements. This 
purely random process has been adapted by Montanvert [23] to the connected 
component analysis scheme by restricting the decimation process to a set of 
subgraphs of the graph G[. Note that, using such restriction, the MIS is defined 
separately on each subgraph. Therefore, two surviving vertices may be adjacent 
in Gi if they do not belong to the same subgraph. The definition of the subgraphs 
is equivalent to the definition of a function A from Ei to {0, 1}. For each {v, v') S 
El, X{{v, v')) is set to 1 if u and v' belong to the same subgraph and 0 otherwise. 
Applied to the connected component analysis framework, the function A is simply 
defined by setting A((u, w')) to 1 if u and v' belong to the same component (each 
surrounded by a closed curve in Fig. 4). 

Applied to the segmentation scheme, the function A should be adapted at 
each level in order to take into account the greater variability of the input data. 
Jolion [14] improves the adaptability of the decimation process by using the 
local maxima of an interest operator instead of the local maxima of the random 
variable. For example, within the segmentation framework, Jolion defines the 
operator of interest as a decreasing function of the gray level variance computed 
in the neighborhood of each vertex. This operator provides a location of surviving 
vertices in homogeneous regions. 

2.4 Parent-Child Definition 

Given the set of surviving vertices, constraint (1) of a MIS insures that each non- 
surviving vertex is adjacent to at least one surviving vertex. Using the stochastic 
decimation process of Meer [22] and Montanvert [23] , each non surviving vertex is 
attached to its adjacent surviving neighbor with the greatest value of the random 
variable. Jolion [14] uses a contrast measure such as the gray level difference, to 
link each non-surviving vertex to its least contrasted surviving neighbor. Each 
surviving vertex is thus the father of all non-surviving vertices attached to it. The 
set of children of one father is called its reduction window by reference to Regular 




a) 4— connected 
sampling grid 




b) subgraphs 
defined by the 
function A 



Fig. 4. The connected components defined by function A 



Introduction to Combinatorial Pyramids 



113 





Fig. 5. Construction of a Maximal Independent Set using a random variable. 
The outcome of the random variable are displayed inside each vertex 



pyramid notations. Fig. 5 represents the outcomes of the random variable on a 
graph defined from the 4x4 8-connected sampling grid. The surviving vertices 
induced by the outcomes of this random variable are represented by an extra 
circle. The boundary of each reduction window is superimposed to the figure. 

2.5 Connecting Surviving Vertices 

The last step of the decimation process consists in connecting surviving vertices 
in Gi+\. Meer [22] joins two fathers by an edge if they have adjacent children 
(e.g. vertices labeled 15 and 7 in Fig. 6(a)). Since each child is attached directly 
to its father, two adjacent fathers at the reduced level are connected in the level 
below by paths of length less than three. Let us call these paths connecting 
paths. Using Constraint (2) of a MIS, two surviving vertices cannot be adjacent 
in Gi- Therefore, the length of the connecting paths is between 2 and 3. Using 
the restriction of the decimation process to subgraphs [23,14] defined by the 
function A, two surviving vertices may be adjacent by an edge mapped to 0 by 
A. In such cases, the length of the connecting path is equal to 1. 

2.6 Reaching the Top of the Pyramid 

The irregular pyramid is thus built recursively from Gq (the original sampling 
grid) to the apex G*. The apex of the pyramid cannot be reduced by the decima- 
tion process. This case occurs when Gt is reduced to a single vertex or when no 





Fig. 6. The reduced graph deduced from the maximal independent set (a) and 
the Parent-Children relationships (b) 
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decimation rule may be applied on the graph. For example, within the connect- 
ing component analysis scheme, the pyramid may be reduced until all connected 
components of the initial graph are reduced to a single vertex [23]. Within the 
segmentation framework, the decimation process may also stop when the dif- 
ference between the value of any adjacent vertices exceeds a threshold. Another 
strategy consists in decimating the initial graph until it reduces to a single vertex 
and to select a set of roots within the pyramid. Jolion [14] defines a root as a 
vertex whose contrast measure with its parent exceeds a threshold. This strategy 
is also used within the regular pyramid framework [10] and allows us to select 
roots at different levels of the pyramid. 

2.7 The Need for Multi-edges and Self-Loops 

The stochastic pyramids used by Meer [22] and Montanvert [23] or the adaptive 
pyramids defined by Jolion [14] combine the advantages of their regular ancestors 
with a greater adaptability to the input data. However, both the stochastic and 
adaptive reduction process utilize simple graphs e.g. graphs without multiple 
edges and self-loops. Therefore, if the graphs of the pyramid are used to encode 
a partition, an edge between two vertices in a simple graph pyramid encodes 
disconnected boundaries between the associated regions. Moreover, the lack of 
self-loops does not allow to differentiate inclusions from adjacencies relationships. 

Fig. 7 illustrates these inadequacies. The left and right regions of the left 
image share two distinct boundaries. However, a reduction process such as the 
one defined in the stochastic pyramid framework provides a final graph encoding 
these multiple boundaries by a single edge. 

This last drawback is induced by the definition of the edges between the 
surviving vertices. Indeed, two surviving vertices having several adjacent children 
will be connected by a single edge. A better understanding of this drawback may 
be achieved by interpreting the decimation process in terms of edge contractions 
and edge removals. The contraction of one edge {v,v') consists to identify v 
and v' into a new vertex v" and to remove the edge (u, v'). Note that, if v and v' 
are connected by an other edge, this last one becomes a self-loop doubly incident 
to v" . Since the contraction operation first identifies two distinct vertices, the 




Fig. 7. Inadequacy of the stochastic or adaptive decimation process 
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contraction of self-loops is not defined. The removal of one edge consists to 
remove it from the set of edges of the graph. 

Using the edge contractions and removal operations the decimation process 
performed by Meer [22], Montanvert [23] and Jolion [14] is equivalent to the 
application of the following steps: 

1 . The selection of a set S of surviving vertices. 

2. The definition of a set of edges N linking each non-surviving vertex to its 
father. 

3. The contraction of the set of edges N. 

4. The removal of all multiple edges and self-loops. 

The definitions of the sets S and N vary according to the considered method. 
However, within the stochastic [22,23] or adaptive [14] framework, the step 4 
of the decimation process removes all multiple edges and self-loops between 
surviving vertices. 

2.8 Contraction Kernels 

Kropatsch [15] encodes the step 3 of the decimation process using a Contraction 
Kernel. A Contraction Kernel is defined on a graph G = (V, A) by a set of 
surviving vertices S', and a set of non surviving edges N (represented by bold 
lines in Fig. 8) such that: 

— (U, N) is a spanning forest of G, 

— Each tree of {V,N) is rooted by a vertex of S. 

Since the set of non surviving edges N forms a forest of the initial graph, 
no self loop may be contracted and the contraction operation is well defined. If 
the graph is deduced from the initial sampling grid, each vertex on the border 
of this grid has to be adjacent to a special vertex encoding the background 
of the grid and called the background vertex. In order to preserve the image 
boundary, any edge encoding an adjacency relationship between a vertex of the 
graph and the background vertex must be excluded from N. Therefore, an 




Fig. 8. A contraction Kernel {S, N) composed of three trees. Vertices belonging 
to S are represented with a filled circle inside (•) 
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initial graph Go deduced from the sampling grid, may at most be reduced up 
to a graph G* defined by two vertices connected by a single edge. If each vertex 
of Go is associated to one pixel of the sampling grid, the two vertices encode 
respectively the image and its background. The edge between the two vertices 
corresponds to the image boundary. 

The decimation of a graph by Contraction Kernels differs from the Stochas- 
tic [22,23] or Adaptive [14] decimation processes on the two following points: 

— First, using Contraction Kernels, the set of surviving vertices is not required 
to form a MIS. Therefore, two surviving vertices may be adjacent in the 
contracted graph. This last property avoids the use of the function A defined 
by Montanvert [23] and Jolion [14]. 

— Secondly, using Contraction Kernels a non surviving vertex is not required 
to be directly linked to its father but may be connected to it by a branch of 
a tree (see Fig 8). The set of children of a surviving vertex may thus vary 
from a single vertex to a tree with any height. 

Fig. 9 illustrates the decimation performed by the Contraction Kernel dis- 
played in Fig. 8 by contracting successively each tree of the forest {V, N). Note 
that all the contractions may be performed in parallel. 

The contraction of a graph reduces the number of vertices while maintaining 
the connections to other vertices. As a consequence, the decimation of a graph 
by a Contraction Kernel may induce the creation of some redundant edges. The 
characterization of these edges requires a better description of the topological 
relationships between the objects described by the graph. Since images are often 
projections of the reality into a plane, the image structure is planar and can 
be described by planar graphs. Moreover, if the initial graph is deduced from 
an initially planar grid (like 4-neighborhood) the initial graph and its reduced 
versions are planar. Given an initial planar graph G, the vertices of its dual G 
are located inside every face of G. The edges of G connect those dual vertices of 
which the corresponding faces are adjacent (lines between boxes in Fig. 10(a)). 
Note that the construction of the dual graph induces a one to one mapping 
between the edges of both graphs. Therefore, if N denotes a set of edges of 






Fig. 9. The successive contractions of the trees defined in Fig. 8 
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a) (Go, Go) b)(Gi,Gi) 

Fig. 10. The initial planar grid with the associated dual gaph(a), and charac- 
terization of redundant edges (b). Dual vertices incident to redundant dual edges 
are represented by filled boxes 



the initial graph, we can denote by N the set of associated edges in the dual 
graph. A well known result of graph theory [13] states that if G' is obtained from 
G = (y, E) by contracting a set N C E, the dual of G' may be obtained from 
G by removing N in G. Conversely, if G' is obtained from G by removing a set 
N C E, G' is obtained from G by contracting the edges N in G. 

The characterization of redundant edges using the dual of the contracted 
graph is illustrated in Fig 10. If the initial graph is deduced from the initial 
4-connected sampling grid, each dual vertex encodes a corner of a pixel and each 
edge of the dual graph encodes a side of a pixel. The upper filled box of Fig 10(b) 
represents a dual vertex with a degree equal to two. The two vertices defining 
the face of this dual vertex correspond to two regions of the initial image sharing 
a boundary artificially split in two subpaths by the dual vertex. The contraction 
of any of the two dual edges incident to this dual vertex concatenates the two 
paths. The contraction of this dual edge has to be followed by the removal of 
the associated edge in the initial graph to maintain the duality between both 
graphs. In the same way, the lower filled box of Fig 10(b) represents a dual vertex 
with a degree one. The empty self-loop associated to this dual vertex encodes 
an adjacency relationship between two vertices contracted in a same vertex by 
the Contraction Kernel. This useless^ adjacency relation is deleted by removing 
the self-loop and contracting the associated dual edge. 

The level generation from a pair of dual graphs (Gq, Gq) using a Contraction 
Kernel {S, N) is thus performed in two steps respectively illustrated in Fig. 11(b) 
and Fig. 11(c): 

1. A set of edge contractions on Go encoded by the Contraction Kernel {S, N). 
The dual of the contracted graph Gi is computed from Gq by removing the 
dual of the edges contained in N. 

2. The removal of redundant edges encoded by a Contraction Kernel applied 
on the dual graph. Let us call this kernel a Removal Kernel. The edge con- 

A self-loop is ’useless’ if it does not surround any surviving part of the graph 



1 
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a) (Go, Go) b)(G3,G3) c)(G4,G4) 

Fig. 11. Two steps of the graph reduction operation encoded by Contraction 
Kernels 



tractions performed in the dual graph has to be followed by edge removals in 

the initial one in order to preserve the duality between the reduced graphs. 

If each vertex of the original graph Go encodes a pixel of the sampling grid, 
each vertex of the contracted graphs encodes a connected region of this grid. 
The preservation of the meaningful edges by the two step strategy defined by 
Kropatsch induces a one to one mapping between the edges of the graph and 
the region boundaries. Experiences with connected component analysis [18], with 
universal segmentation [16], and with topological analysis of line drawings [17] 
show the great potential of this concept. 

3 Combinatorial Maps 

Combinatorial maps and generalized combinatorial maps define a general fram- 
work which allows to encode any subdivision of nD topological spaces orientable 
or non-orientable with or without boundaries. The concept of maps has been 
first introduced by Edmonds [12] in 1960 and later extended by several authors 
such as Lienhardt [20], Tutte [26] and Cori [11]. This model has been applied to 
several fields of computer imagery such as geometrical modeling [3] , 2D segmen- 
tation [1,5] and graph labeling [21]. An exhaustive comparison of combinatorial 
maps with other boundary representations such as cell-tuples and quad-edges is 
presented in [20] . Recent trends in combinatorial maps apply this framework to 
the segmentation of 3D images [6,2] and the encoding of hierarchies [8,9]. 

The remaining of this paper will be based on 2D combinatorial maps which 
will be just called combinatorial maps. A Combinatorial map may be deduced 
from a planar graph by splitting each edge into two half edges called darts. The 
relation between two darts di and ^2 associated to the same edge is encoded by 
the permutation a which maps d\ to ^2 and vice-versa. The permutation a is 
thus an involution and its orbits are denoted by a* (d) for a given dart d. These 
orbits encode the edges of the graph. Moreover, each dart is associated to a 
unique vertex. The sequence of darts encountered when turning around a vertex 
is encoded by the permutation a. Using a counter-clockwise orientation, the 
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orbit <y*{d) encodes the set of darts encountered when turning counter-clockwise 
around the vertex encoded by the dart d. A combinatorial map can thus be 
formally defined by: 

Definition 1. Combinatorial Map 

A combinatorial map G is the triplet G = (X>, tr, a), where T> is the set of 
darts and a, a are two permutations defined on T> such that a is an involution: 

\/d £T> ao a{d) = d 

Note that, if the darts are encoded by positive and negative integers (as in 
Fig 12(a)), the involution a may be implicitly encoded by: 

Vd e X> a{d) = -d 

This convention is often used for practical implementations of combinatorial 
maps [5,7] where the permutation a is simply implemented by an array of integers 
(see Fig. 12 (d)). 

Using the combinatorial map formalism, the dual G of a combinatorial map 
G = (X>, CT, a) is defined as G = (X>, tp = aoa,a). The orbits of the permutation 
ip encode the set of darts encountered when turning counter-clockwise around 
the vertices of the dual graph (Fig 12(b)). Such orbits may also be interpreted 
as the sequence of darts encountered when turning around a face. Note that due 
to our orientation conventions, each dart of an orbit have its associated face on 
its right (e.g. the (/7-orbit (1,3,2) in Fig 12(a)). 

A combinatorial map and its dual being deduced one from the other by a 
basic transformation, many properties of one graph remain true for its dual. 
For example, if the initial combinatorial map is connected its dual is also con- 
nected [8]. Moreover, many particular configurations such as self- loops (see Ta- 
ble 1) remain particular configurations in the dual combinatorial map (see [8] 
for a demonstration of these results) . 

Contraction kernels described in Section 2.8 may induce the creation of some 
redundant edges. Using the dual graph framework, such redundant edges are 
characterized by their associated dual edges which are incident to dual vertices 
with a degree lower than 3. Using the combinatorial map framework one of the 
dart of a redundant edge a*{d) is either a self-direct-loop (see Table 1) or belongs 



Table 1. Relationships between particular configurations in the original and 
dual combinatorial maps 



(T>,a,a) 


{T>,p,a) 


self- loop a{d) G <r*{d) 

bridge a{d) € p*{d) 

self-direct-loop {a{d) = a{d) or a{a{d)) = d) 
pendant dart {a{d) = d or a{a{d)) = a{d)) 


bridge 
self-loop 
pendant dart 
self-direct-loop 
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a = (l,-l)(2,-2)(3,-3)(4,-4)(5,-5)(6,-6) 
]a = (-6,-5,-3,2)(3,4,6,-l)(5.-4)(l,-2) 

, (p = (1, 3. 2)(4, 5, -3)(-4, 6, -5)(-l. -2, -6) 



d 


-6 


-5 


-4 


-3 


-2 


-1 


1 


2 


3 


4 


5 


6 


a{d) 


-5 


-3 


5 


2 


1 


3 


-2 


-6 


4 


6 


-4 


-1 



Fig. 12. The combinatorial maps corresponding to G 4 and its dual. The lower 
part of this figure represents an encoding of the permutation a by an array of 
integers 



to an orbit of ip with a cardinal equal to 2. This last condition characterizes dual 
vertices with a degree 2 and may be checked by: ip^{d) = d or ip{a{d)) = a{d). 
Combinatorial maps have thus the following interesting properties: 

— The darts are ordered around each vertex and face. Note that, this informa- 
tion is not encoded by the simple graph data structure used by Meer [22], 
Montanvert [23] and Jolion [14] nor explicitely available in dual graph data 
structures. 

— The simplicity and the efficiency of the computation of the dual combinato- 
rial map avoids an explicit encoding of the dual graph. 

— The combinatorial map formalism may be extended to any dimensions [19]. 

4 Combinatorial Pyramids 

The aim of combinatorial pyramids is to combine the advantages of combinatorial 
maps with the reduction scheme defined by Kropatsch [15]. A combinatorial 
pyramid will thus be defined by an initial combinatorial map successively reduced 
by a sequence of contraction or removal kernels defined in the combinatorial 
map framework. The definition of combinatorial pyramids requires thus a formal 
definition of the contraction and removal operations in the combinatorial map 
framework. 

Using combinatorial maps, the removal operation (see Definition 2) is en- 
coded as an update of the permutation a. The analytic form of Definition 2 is 
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given by proposition 1 . Note that this last operation may be performed efficiently 
if fj is encoded by an array of integers. 

Definition 2. Removal Operation 

Given a combinatorial map G = (X>, cr, a) and a dart d € "D. If a*{d) is not 
a bridge, the combinatorial map G' = G \ a*{d) = (T>' ,a',a) is defined by: 

- V = V\a*{d) and 

— a' is defined as: 

y d G T>' <j'{d) = cr"((i) with n = Min{p G IN* / cr^(d) ^ a*(d)} 

Note that the bridges are excluded from Definition 2 in order to keep the number 
of connected components of the combinatorial map. Moreover, since both the 
dart d and its opposite a{d) are removed, the involution a of G' is restricted to 
the reduced dart set T>\a*{d). Although, the involution a of the reduced graph is 
mathematically different from the one of the original graph, the implicit encoding 
of the permutation a by the sign (see Section 3) may remain unchanged in both 
graphs. We thus consider that the involution a is unchanged by the removal 
operation. 

Proposition 1. Given a combinatorial map G = fiD,a,a) and a dart d G "D 
which is neither a bridge nor a self- direct-loop, the combinatorial map G\a*{d) = 
(X> \a*{d),a',a) is defined by: 

( yd' G T>\ a~^{a*{d)) <j'{d') = <j{d') 

\ (j'{(j-\a{d))) = cr(a(d)) 

As stated in Section 2.8, the removal of one edge on a planar graph induces 
the contraction of its associated edge in its dual. Conversely, the contraction 
of one edge on the initial graph is equivalent to the removal of the associated 
dual edge in the dual graph (Fig. 12). This last property allows us to define the 
contraction operation as a removal performed in the dual combinatorial map. 

Definition 3. Contraction Operation 

Given a combinatorial map G = {T>,a,a) and a dart d G T> which is not 
a self-loop. The contraction of dart d creates the combinatorial map G/a*{d) 
defined by: 

G/a*{d) = G\a*{d) 

Note that this operation is well defined since d is a self-loop in G iff it is a bridge 
in G. Thus, any sequence of removal or contraction operations will preserve the 
number of connected components of the initial graph. This last property is useful 
in the irregular pyramids framework because it attempts to simplify the initial 
planar map while preserving its essential topological properties. Moreover, since 
the involution a is not modified by the removal operation and remains the same 
for one combinatorial map and its dual (see Section 3) it is also unchanged by 
the contraction operation. 
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a) G4/a*(5) 



-2 -1 




Fig. 13. Contraction of the edge a* (5) from G 4 and the associated dual removal 



Since the contraction operation is equivalent to a removal operation per- 
formed on the dual combinatorial map, the analytic form of Definition 3 may be 
deduced from the analytic form of the removal operation by substituting by 
fj in Proposition 1. This analytical form, rewritten in terms of modifications of 
the permutation a is provided in Proposition 2. The proof of this result together 
with the description of the special cases excluded from Propositions 1 and 2 may 
be found in [8]. 

Proposition 2. Given a combinatorial map G = (X>, cr, a) and a dart d G "D 
which is neither a pendant edge nor a self loop. The combinatorial map 
G/a*{d) = (X> \ a', a) is defined by: 

( yd' GT> \ a~^{a*{d)) cr'{d) = a{d) 

I cr'(cr-l(d)) = cr(a(d)) 

[ a'(a~^(a(d))) = a(d) 

Note that, since the dual graph is implicitly encoded, any modification of the 
initial combinatorial map will also modify its dual. Therefore, using combinato- 
rial maps the dual combinatorial map is both implicitly encoded and updated. 
Moreover, the updates of the permutation a by the contraction operation pre- 
serves the orientation of the original combinatorial map. For example, let us 
suppose that the empty vertices displayed in Fig 14(a) represents two adjacent 
regions surrounded by the regions l,2,3,4,a,b,c,d. This last sequence encodes the 
order in which the adjacent regions will be encountered when turning counter- 
clockwise around the two adjacent regions. The order of this sequence is pre- 
served when the two regions are merged in Fig. 14(b), by the contraction of 
their common edge. 

Definitions 2 and 3 define the value of the permutation a after the removal 
or the contraction of one edge. Such definitions may be extended to the removal 
or the contraction of a set of edges [8]. However, in order to avoid testing the 
preconditions of these operations for each edge we have to adapt the concept of 
contraction kernel (see section 2.8) to the combinatorial map framework. 
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a) 



b) 



Fig. 14. Preservation of the orientation using combinatorial maps 



Definition 4. Contraction Kernel 

Given a connected combinatorial map G = {T>, a, a) and a non-empty set of 
darts B <zT> encoding the adjacency relationship between G and its background. 
The set K <ZT> will be called a contraction kernel iff: 

1. K is a forest of G, 

2. K does not intersect B. 

The set ST> = T> — K is called the set of surviving darts. 

Note that the set of surviving darts of a contraction kernel cannot be empty 
since it contains at least B. Unlike dual graph contraction kernels, contraction 
kernels defined within the combinatorial map framework do not specify the sur- 
viving vertices. Within such framework, the surviving vertices are implicitly 
defined by the orbits of the updated permutation a. 

Given an initial combinatorial map Go = {'D,a,a) and a contraction ker- 
nel K, the contracted combinatorial map G' = G/K = {S'D,(r\a) may be 
deduced from G by the iterative contraction of each dart contained in K us- 
ing proposition 2. However, this last solution does not allow a straightforward 
parallel implementation and induces unnecessary updates of the permutation a. 
The parallel computation of the contracted combinatorial map is based on the 
concept of connecting walk: 

Definition 5. Connecting Walk 

Given a combinatorial map G = (fD,a,a), a contraction kernel K and a dart 
d € ST>, the connecting walk associated to d is equal to: 

CW{d) = d, ip{d), ..., ip'^-^id) with n = Min{p e IN* | <pP{d) G SV} 

Note that, only the first dart of each connecting walk survives. Each con- 
necting walk CW{d) connects the surviving darts d and g:'^{d) by a sequence of 
non surviving darts. 
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Connecting walks may be considered as an extension of the connecting paths 
used by Jolion [14] and Montanvert [23] (see Section 2.5). However, using contrac- 
tion kernels, the MIS requirements are substituted by the more general concept 
of forest requirement. Therefore, the paths defined by Montanvert and Jolion 
have to be replaced by the more general concept of walks [13]. Moreover, since 
the trees of the forest may have any depth, the length of a connecting walk is 
not bounded by 3. Finally, using combinatorial maps, a connecting walk is an 
ordered sequence of darts which preserves the order defined on X> by the per- 
mutation ip. More precisely, given a surviving dart d and its connecting walk we 
have [9]: 

Vd e SV ip'{d) = (3) 

Where n = Min{p e IN* j ip^{d) G ST>} and (p' = a' o a denotes the ip permuta- 
tion of the contracted graph G / K . The permutation a being an involution, the 
permutation a' may be retrieved from ip' by a' = ip' o a. We have thus: 

Vd S SV O'' {d) = (/?"(a(d)) with n = Min{p € IN* j ip^{a{d)) G SV} 

Note that, </j"“^(a(d)) is the last dart of CW{a{d)). Therefore, the per- 
mutation a remains unchanged in the contracted combinatorial map while the 
permutation o' maps each surviving dart d to the (^-successor of the last dart 
of CW{a{d)). Therefore, the computation of the cr-successor of a dart d in the 
contracted combinatorial map requires to traverse CW{a{d)). Sequential algo- 
rithm 1 computes the cr-successor of all the surviving darts in the contracted 
combinatorial map. Since the set of connecting walks forms a partition of V (see 
proof in [9]), Algorithm 1 has to traverse \T>\ darts. Its complexity is thus equal 
toO{\V\). 



1 dart contracted_sigma(G = (■!?, cr, a), A) 

2 { 

3 For each d G SV = "D — K 

4 do 

5 d' = (p{a{d)) = a[d) 1 1 Second dart of CW {a{d)) 

6 

7 while(d^€A) // computation of CW {a{d)) 

8 d' = ip(d') 

9 o'{d) = d' 

10 done 

11 } 



Algorithm 1: Computation of the permutation a' of the contracted combinatorial 
map C' = {SV, a' , a) 



Using the dual graph pyramid scheme, a removal kernel is defined as a con- 
traction kernel applied on the dual combinatorial map. Therefore, all results 
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already obtained on contraction kernels remain valid in the dual combinatorial 
map for removal kernels [9]. The connecting walks defined by removal kernels 
may be constructed as a sequence of cr-successors of a surviving dart: 

yd € SV CWnid) = d.a{d) . . . cr””^(d) with n = Min{p e IN* | aP{d) e SV}. 

The cr-successor of any surviving dart d in the removed combinatorial map 
is then defined as cr"((i) where cr"“^(d) is the last dart of CWn{d). The updated 
permutation a' may thus be computed from the original combinatorial map by 
a traversal of the connecting walks defined by the removal kernel. This traversal 
may be performed by a slightly modified version of Algorithm 1 where line 8 is 
replaced by the instruction d' = a{d). 

Using the same scheme as dual graph pyramids (see section 2.8), the level 
generation using a contraction kernel K is performed in two steps: 

1. One set of edge contractions encoded by a contraction kernel Ki. 

2. The removal of redundant edges encoded by a removal kernel K 2 - Such re- 
dundant edges are incident to faces with a degree one or two (see Section 2.8 
and 3). 

5 Conclusion 

In this paper we have described different hierarchical models and some of the 
basic processes working on them: regular, irregular and combinatorial pyramids. 
All three possess the following important well known properties [4] : 

— Independence to resolution, 

— global to local interactions, 

— log(imagejliameter) parallel complexity. 

In the remaining of this section, we want to differentiate the advantages and 
drawbacks of the three types of pyramids discussed in the paper. The first part 
of Table 2 describes the properties of the basic entities represented at each level 
of the structure. In the second part, we consider the properties of the projection 
of these basic entities into the image plan. 

Each level of the hierarchy encodes a set of resolution cells (“region”) and 
their respective adjacencies. A resolution cell is explicitely encoded by a pixel 
or a vertex of the graph structure while it is implicitely coded by a cr-orbit 
of the combinatorial map. An adjacency relation between two resolution cells 
is implicitely encoded by a crack between two 4-adjacent pixels while it is ex- 
plicitely coded as an edge of the graph or a couple of darts (d, a{d)) in the other 
hierarchies. A point is defined where three or more resolution cells meet. It is 
implicitely defined as a pixel corner in the regular structure and as a :/?-orbit in 
the combinatorial map while it is not encoded by the simple graph pyramids and 
explicitely encoded as a dual vertex in the dual graph. A combinatorial map is 
the only structure which explicitely codes the orientation of the resolution cells 
local arrangements by the permutation a. 
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Table 2. Basic hierarchical entities and properties of their image embedding 



Pyramids 


Regular 


Irreg 

Simple graph 


mlar 

Dual graph 


Combinatorial 


Region 


pixel 


vertex 


vertex 


(T-orbit 


Adjacency 


crack 


edge 


edge 


{d,a{d)) 


Point 


pixel corner 


- 


dual vertex 


yi-orbit 


orientation 


- 


- 


implicit 


a, ip 


Properties of image embedding 


Receptive held 


regular shape 


arbitrary shape 


arbitrary shape 


arbitrary shape 


Segmentation 

scheme 


linking 


adaptive deci- 
mation 


adaptive dual 
contraction 


adaptive dual 
contraction 


Region(vertex) 


not connected 


connected 


connected 


connected 


Boundaries 

(edge) 


not connected 


not connected 


connected 


connected 



The receptive field of a resolution cell defines its projection into the image 
plane. The regular pyramid is the only structure which restricts the form of 
the receptive fields to regular shapes. Using hierarchies, the segmentation of the 
image plane is achieved by selecting a set of roots(Section 2.6). The receptive 
field of each root defines a region of the image plane. The pyramid linking [10] 
algorithm adapts the shape of the regular pyramid receptive fields to the image 
content. All other hierarchies may use the receptive fields obtained from the 
pyramid construction scheme. Moreover, the pyramid linking algorithm is the 
only segmentation scheme which may produce non connected regions (e.g. [24]). 
A boundary is defined as the image embedding of an adjacency relation. This 
boundary may be not connected using regular or simple graph hierarchies. 

Dual graph and Combinatorial pyramids are thus the only hierarchies which 
provide both connected regions and connected boundaries. Combinatorial pyra- 
mids combine the advantages of dual graph pyramids with an explicit orientation 
of the boundary segments of the embedded object thanks to the permutation 
fj. Moreover, using combinatorial maps, the dual graph is both implicitely en- 
coded and updated. Finally, the combinatorial map formalism is defined in any 
dimensions. 
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Abstract. In this paper, we consider the problem of representing a mul- 
tiresolution geometric model, called a Simplicial Multi-Complex (SMC), 
in a compact way. We present encoding schemes for both two- and three- 
dimensional SMCs built through a vertex insertion (removal) simplifica- 
tion strategy. We show that a good compression ratio is achieved not only 
with respect to a general-purpose data structure for a SMC, but also with 
respect to just encoding the complex at the maximum resolution. 



1 Introduction 

Simplicial complexes (triangle and tetrahedral meshes) are popular representa- 
tions for surfaces, objects, scalar and vector fields in applications, such as Geo- 
graphic Information Systems (GISs), computer graphics, virtual reality, medical 
imaging, finite element analysis. The availability in the applications of very large 
meshes has led to an intense research activity on mesh simplification and on mul- 
tiresolution models. 

Data simplification is a tool for reducing a mesh to a manageable size: it 
reduces the resolution of a mesh (i.e., the number and the density of its cells), 
while loosing accuracy. In many cases, however, simplification needs to be applied 
selectively, i.e., only in those portions of a mesh that are less relevant for the 
current application. Since the regions of interest for an application may vary 
dynamically, on-line simplification would be important. Mesh simplification, on 
the other hand, is an expensive task, which cannot be performed on-line on 
meshes of large size. 

The basic idea underlying multiresolution modeling consists of de-coupling 
the simplification phase (performed off-line) from the selective refinement phase 
(performed on-line). A multiresolution model basically encodes the steps per- 
formed by a simplification process as a partial order, from which a virtually 
continuous set of meshes at different Levels Of Detail (LODs) can be extracted. 
In [-5] , we have described a general multiresolution model, called a Multi-Gomplex 
(MG), which consists of directed acyclic graph representing a partially ordered 
set of mesh updates. The Multi-Gomplex is dimension-independent as well as 
independent of the specific simplification strategy used for its construction. On 
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the other hand, the multiresolution approach tends to introduce an overhead 
due to the need of encoding several complexes at various LODs as well as the 
partial order. 

The fundamental operation on a multiresolution model is the extraction of 
meshes at variable resolution, which satisfy user-defined criteria, and have a 
minimum size. Such operation is known as selective refinement [9]. Most common 
application-dependent queries on two- and three-dimensional meshes reduce to 
instances of selective refinement, since they need a high resolution just in a 
specific part of the mesh. Some examples are contour, or isosurface extraction, 
windowing, intersection with a plane, or with a polygonal line [4]. 

In this paper, we consider a specialization of the MC to simplicial complexes, 
that we call a Simplicial Multi-Complex (SMC). We describe first a general data 
structure for SMCs which is independent of the dimension and of the construc- 
tion technique. However, such structure has a large overhead since it is about 
three times larger than a standard data structure representing the simplicial 
complex available from the SMC the maximum resolution. 

Then, we present compact data structures for a specific class of SMCs in 
two and three-dimensions, constructed on the basis of a vertex-insertion, or 
vertex-removal simplification strategy. We show that such structures provide 
good compression ratios not only with respect to the general SMC data structure, 
but even with respect to encoding just the original mesh at full resolution. All 
data structures are analyzed with respect to both their space requirements and 
their efficiency in supporting the primitives needed for implementing selective 
refinement. 

The rest of the paper is organized as follows. Section 2 introduces multireso- 
lution modeling and the vertex-based SMC. Section 3 is concerned with general 
issues about encoding multiresolution models, and includes a brief overview of re- 
lated work. Section 4 describes the selective refinement query. Section 5 presents 
a general-purpose data structure for the SMC. Section 6 describes and analyzes 
compact ways for encoding vertex removals in both two and three dimensions. 
Section 7 presents comparisons among the various data structures. Finally, Sec- 
tion 8 contains some concluding remarks. 



2 Multiresolution Modeling and the Simplicial 
Multi-complex 

We denote with k and d two natural numbers such that k < d. A k-dimensional 
simplex a is the locus of points that can be expressed as the convex combination 
of fc -|- 1 affinely independent points in 1E‘^, called the vertices of cr. Any simplex 
having its vertices at a subset of the vertices of a is called a facet of cr. 

A simplicial complex in is a finite set E of simplices of heterogeneous 
dimension such that: for any simplex cr S A, all facets of cr are in A; and, for 
any pair of simplices cri , ct 2 S A, their intersection is either empty or a common 
facet of iTi and ct 2 . 
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A fc-dimensional simplicial complex (for short, a fc-complex) is regular ii and 
only if every simplex is a facet of some fc-simplex. A regular fc-complex is com- 
pletely defined by its fc-simplices. Here, we will deal only with regular complexes 
whose domain is a manifold. Therefore, we will identify a fc-complex with the set 
of its fc-simplices (hereafter called just simplices, whenever no ambiguity arises) . 
The manifold assumption is consistent with the modification operator we adopt 
to generate multiple levels of details (i.e., vertex insertion / removal, which will 
be defined later), since such operator preserves the manifold property of a mesh. 

The general idea behind data simplification techniques is to perform a se- 
quence of atomic modifications on a given mesh by either removing details from 
a mesh at high resolution, or adding details to a coarse mesh. Specific simpli- 
fication algorithms are characterized by using a certain type of modification 
operators to act on the mesh. The most popular ones are vertex insertion, vertex 
removal, and edge collapse. 

A survey on data simplification for triangle meshes can be found in [9] . Data 
simplification algorithms for tetrahedral meshes have been proposed more re- 
cently in the literature, and they are based on either edge collapse [1,12], or on 
vertex insertion [2,20]. Algorithms for vertex insertion (and removal) on tetra- 
hedral meshes have been proposed in [21]. 

Multiresolution models of meshes are built on top of a data simplification 
process. They encode the range of Levels of Detail (LODs) generated during 
the simplification, and arrange them by defining suitable dependency relations, 
which are used to guide the extraction of specific LODs. 

Several two-dimensional multiresolution models have been proposed in the 
literature for terrains in Geographic Information Systems (GISs), and for sur- 
faces in computer graphics and virtual reality. They can be either based on nested 
regular meshes [6,7,17], or on irregular triangular meshes [4,8,13,16]. A fewer 
number of three-dimensional multiresolution representations have also been pro- 
posed, mainly for describing scalar fields in scientific visualization. Again, they 
can be classified into models based on regular nested meshes [10,15,18,23], and 
models based on irregular simplicial complexes [19]. In [5], we have shown that 
all proposed models are special cases of a Multi-Gomplex (MG), a general and 
dimension-independent framework for multiresolution modeling based on cell 
complexes. 

Here, we consider the Simplicial Multi-Complex (SMG), which specializes the 
MG defined in [5] to simplicial cells. The SMG is a general multiresolution model 
for simplicial complexes that is completely independent on the dimension of the 
complex, on the dimension of the embedding space, and on the strategy used to 
obtain the various LODs included in the model. 

The definition of a SMG is based on the concept of a local update: an operator 
that, when repeatedly applied to a complex, monotonically increases or decreases 
its resolution (i.e., the number and density of its simplices). In this paper, we 
will consider local updates which refine a complex through the insertion of a new 
vertex, or coarsen it through the removal of an existing vertex. 
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Any sequence of updates starts from an initial complex and terminates with 
a final complex which represent the two extrema of the Level Of Details (LODs) 
spanned by the update process. We call reference complex the complex at maxi- 
mum resolution, and base complex the one at minimum resolution. Any coarsen- 
ing sequence can be reversed into a refinement one, and vice versa. We consider 
a local update as an invertible operation that either refines or coarsens a com- 
plex locally by replacing a connected subset of its simplices with another set 
of simplices having a higher or a lower resolution, respectively, in such a way 
that the result is still a regular and manifold complex (see [5] for conditions to 
guarantee these properties). An update is denoted as a pair u = (u“, m^), where 
u~ represents the coarsening operation, and represents its inverse refinement 
operation. Alternatively, u~ and u~^ can be seen as sets of simplices: u~ is the 
set of simplices that disappear from the mesh when refining it, and appear when 
coarsening it, and the opposite holds for 

In case of vertex-based updates (vertex removal and vertex insertion), u~ is 
a simplicial complex filling a fc-dimensional polytope without internal vertices, 
and u~^ is a set of fc-dimensional simplices all incident in the new vertex Vu ■ The 
boundaries of u~ and are the same. 

A SMC abstracts from the totally ordered sequence of updates performed 
during a mesh simplification process, and defines the partial order describing the 
mutual dependencies between pairs of updates. For the purpose of defining the 
partial order, we look at a simplification process in the coarse-to-fine direction. 
An update U 2 of a given sequence is said to depend on another update u\ if and 
only if ui~^ H U 2 ~ yf 0, i.e., if U 2 ^ remove some simplex created by ui + . 

A SMC encodes a partial order describing the mutual dependencies between 
updates as a Directed Acyclic Graph (DAG). In a SMC M. = {U,A), nodes 
in U represent updates, while arcs in A represent dependency relations between 
updates: an arc <ui,U 2 > is in A if and only if update U 2 depends on ui. The 
creation of the base complex is also represented as a special node, which is the 
unique root of the DAG. Figure 1 shows an example of a two-dimensional SMC. 

We recall that the construction of a vertex-based SMC relies on an algorithm 
for data simplification based on vertex insertion or removal. Several simplification 
algorithms have been proposed in the literature both in two dimensions (for 
triangle meshes representing either scalar fields or free-form surfaces, see [9] for 
a survey), and in three dimensions (for tetrahedral meshes, see [1,12,21]). The 
various existing proposals differ in the strategy they use to drive simplification, 
involving issues such as bounding the geometric error, preserving a good aspect 
ratio of simplices, etc. However, such choices do not affect the possibility of 
building a vertex-based SMC, nor the design of data structures to encode it. 



3 Encoding Multiresolution Models 

The effectiveness of a multiresolution model is highly dependent on its stor- 
age requirements. A multiresolution data structure is characterized by the way 
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Fig. 1. (a) A sequence of vertex insertions refining a two-dimensional simplicial 
complex, where the mesh portion affected by each update is highlighted, (b) The 
corresponding SMC: the dashed line encloses a consistent subset S of nodes; the 
triangle mesh Ss is shown in the circle on the right 



in which it encodes the individual mesh updates and the dependency relations 
among updates. 

A data structure for a two-dimensional SMC, described in [4], adopts an 
explicit representation of both the updates (encoded by listing all involved tri- 
angles) and the dependency links (encoded as a DAG). Such structure introduces 
an overhead which can be too high to be used for representing very large data 
sets. 

More compact data structures are designed for certain subclasses of SMCs, 
i.e, SMCs built by using vertex insertion / removal, or edge collapse, by exploiting 
the topological and geometric nature of specific simplification operators. This 
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allows defining compact encodings for the updates and sometimes for the partial 
order. 

In the two-dimensional case, compact encodings of edge collapses has been 
first proposed by Hoppe [13]. A very compact way to represent dependency links 
has been proposed by El-Sana and Varshney for the special case of general edge 
collapse [8] . Other compact structures for dependencies have been defined based 
on binary trees of vertices [13,16], but they fail to represent the natural notion 
of dependency (see [5] for a discussion). 

A compact encoding of a generalized edge collapse has been proposed by 
Popovic and Hoppe in [19]. This method has been developed for a quite general 
situation, in which the update can affect the topology, as well as the regularity of 
the mesh. A very compact data structure for encoding three-dimensional SMCs 
built based on edge collapse has been proposed in [3]. 

Much less work has been performed for encoding SMCs built based on vertex 
insertion and vertex removal. In two dimensions, a method has been proposed 
by Klein and Gumhold [14]. Based the observation that representing a vertex- 
removal reduces to describing the way in which the polygonal hole left by the 
removed vertex must be triangulated, and that such polygon has a bounded 
number of vertices, they encode all equivalence classes of the possible triangula- 
tions of such polygons, and, for each removed vertex, they store a reference to 
the appropriate equivalence class. In [14], a compact way to encode dependen- 
cies is also described, which will be recalled in Section 5.1. To our knowledge, 
no proposal exists for the three-dimensional case. 

In this paper, we give our proposal of compact data structures for two- and 
three-dimensional SMCs. 

We denote with n, s, m and a the number of vertices, simplices, nodes (up- 
dates) and arcs (dependency links) in a SMC M.. We denote with n and s the 
number of vertices and simplices in the reference complex of M . , respectively (see 
Table 1). We also assume that the base complex has a very small, and, therefore, 
negligible number of vertices and simplices. 

From Euler formula in two dimensions we have that s ~ 2n. In three dimen- 
sions, s is 0{n^) in the worst case, but it has experimentally been found that, 
for real datasets, s « 6n [11]. 



Table 1. Parameters expressing the size of a SMC 



Symbol 


meaning 


context 


n 


^ vertices 


whole SMC 


s 


# simplices 


whole SMC 


m 


# nodes 


whole SMC 


a 


^ arcs 


whole SMC 


n 


^ vertices 


reference complex 


s 


# simplices 


reference complex 


p 


#u-,#n+ <P 


SMC 
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Table 2. Relations that are assumed among the parameters of Table 1, in the 
two- and in the three-dimensional case 



Relation 


restrictions 


justification 


s ~ 2n 


2D 


provable fact 


s « 6n 


3D 


experimentals 


n = m = n 


vertex-based 


provable fact 


< 16 


2D, vertex-based 


construction 


#u~ , < 32 


3D, vertex-based 


construction 


s « 6n 


2D 


experimentals 


s « 18n 


3D 


experimentals 


a K, 3m 


2D 


experimentals 


a ~ 6m 


3D 


experimentals 



For vertex-based SMCs, n = m = n, since (disregarding the vertices of the 
base complex) each node adds a vertex, and all such vertices form the vertices 
of the reference complex. 

We also assume an upper bound P on the number of simplices involved in 
a single update, such that < P. This is achieved by imposing con- 

straints on the mesh simplification algorithm used to build the SMC. The values 
of P = 16 in two dimensions, and P = 32 in three dimensions are reasonable in 
practice. Under this assumption, also the number of arcs entering and leaving 
each SMC node is bounded, and thus a and s are linear in m. 

It has been experimentally found that in the two-dimensional case, s w 6n 
and a « 3m; while, in three dimensions, s « 18n and a « 6m. All such relations 
are summarized in Table 2. 

Finally, we assume that vertex coordinates can be described as 32 bit floating- 
point numbers. All logarithms appearing in formulas are intended in base 2. 



4 Selective Refinement 

The data structures proposed in this paper are evaluated not only based on 
their space requirements, but also with respect to their efficiency in supporting 
the operation of selective refinement. We remark that such structures, although 
called “compressed”, are complete and effective structures on which algorithms 
for selective refinement can be run: they do not need to be decoded into a more 
explicit representation before use. 

The extraction of a complex from a SMC reduces to selecting a subset 

5 = {mi, . . . ,Uk} of the SMC nodes and performing the corresponding updates 
ui ~^ . ., Uk~^ on the base complex. Set S must be consistent with the partial or- 
der defined in the DAG: for every node u € S, all the predecessors of u must be 
also in S. A consistent set S may contain more updates in some areas, and fewer 
updates elsewhere, hence defining a complex S$ whose resolution is variable in 
space. 
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The fundamental query on a multiresolution model, called selective refine- 
ment, consists of extracting a complex E such that the resolution of E sat- 
isfies some user-defined requirements (e.g., having a certain accuracy over a 
region of interest), and E contains a minimal number of simplices. Given a 
SMC M. = {U,A), the resolution requirements for extracting a complex from A4 
are expressed by a means of a Boolean function r defined on the SMC nodes: 
t{u) = true if and only if update u is necessary for achieving the desired reso- 
lution. The value of r over an update u is application dependent. 

A selective refinement query applied to a SMC M. = {U,A), based on r, 
extracts the complex Es, associated with a consistent set S' of At, such that: 

(i) for each simplex a € Es, the update u = (u“, u+), with a G u~ , is such that 

t{u) = false. 

(ii) S is the consistent set with the minimum cardinality that satisfies the pre- 
vious condition. 

If function r is such that the set of nodes = true} is connected, then 

the result of the above query is unique. 

Algorithms for solving such query on a SMC Al = {U, A) perform a traversal 
of the DAG describing M. to find set S. The basic algorithm, that we call static, 
is based on node expansion. It starts with a set S containing just the root, and 
with Es equal to base complex. Then, it adds nodes to S as long as condition (i) 
is not satisfied. Adding a node u to S locally refines Es by applying update 
and may cause recursive insertion of ancestors of u into S. A dynamic variant of 
the algorithm starts each time from the consistent set S generated by a previous 
query, and performs node contraction prior to performing node expansion as in 
the static case: in this phase, all those nodes that are not necessary to satisfy 
condition (i) are eliminated from S. 

Details on algorithms are given in [4]. For the purpose of this paper, it is 
sufficient to notice that primitive operations needed to support such algorithms 
are concerned with traversal of dependency links in the SMC, and maintenance of 
a current status, i.e., a consistent set S of SMC nodes together with its associated 
complex Es- 

DAG traversal primitives retrieve the set of direct ancestors and direct de- 
scendants of a node u G U: 

— operation pred(M) returns the set {ui| <Ui,u>G A}. 

— operation succ(m) returns the set {ui\ <u,Ui>G A}. 

Operations pred and succ are used to test the feasibility of an update u G U 
in the current status {S, Es). It u ^ S, adding it to S produces a consistent set 
(i.e., update can be applied to Es) if and only if pred(w) C S'. If u € S', 
removing it from S produces a consistent set (i.e., update u~ can be applied to 
Es) if and only if succ(u) n S = 0. 

Finally, two operations modify the current status (S, Es) by performing a 
feasible update u~ or 

— operation refinement (u, S, A 5 ) returns a new status {S',Es>), where S' = 

S U {it} and Es> = Es \ {w~} U (if*'}. 
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— operation abstraction(u, S, Us) returns a new status {S' , Us>), where S' = 
S \ {t6} and Us> = Us \ {m^} U {«“}• 

It is important to implement the primitive operations with an optimal time 
complexity, i.e., the cost of pred and succ must be linear in the number of output 
nodes, and that of refinement and abstraction must be linear in Uu’*'). 



4.1 Encoding the Current Complex 

In this Subection, we introduce a data structure for simplicial complexes that is 
used to encode the current mesh Us during selective refinement as well as the 
final result of selective refinement algorithms. 

The simplest way to encode a simplicial complex U is through an indexed 
data structure which stores, for each fc-simplex a, references to the fc-l- 1 vertices 
of cr, while each vertex is stored as a tuple of d coordinates. A variation is 
the indexed data structure with adjacencies, that stores, in addition, the fc -I- 1 
simplices adjacent to each simplex a along a, {k — l)-dimensional facet. 

If complex U has h vertices and s simplices, then the space required for 
encoding it into this data structure is equal to dh bits for the vertex coordi- 
nates, while connectivity information requires s{k + 1) logh bits (in the variant 
without adjacencies) and s{k + l)(logh -I- logs) bits (in the variant with adja- 
cencies). We recall that s ~ 2h in two dimensions (fc = 2), and s ~ 6h in three 
dimensions (fc = 3). It follows that connectivity information requires 6hlogh 
bits (12hlogh -I- 6h with adjacencies) in the two-dimensional case. In three di- 
mensions, 24hlogh bits (48hlogh -I- 72h with adjacencies) are required. 

5 A Data Structure for General Updates 

Representing a SMC A4 = {U, A) requires encoding two kinds of information: the 
updates associated with DAG nodes, and dependency links represented by DAG 
arcs. In this Section, we describe a general-purpose SMC data structure suitable 
to represent any SMC. Such structure is called explicit because it represents an 
update u by explicitly listing the simplices belonging to sets u~ and 

5.1 Encoding Dependencies 

The dependency relation is encoded by associating with each node u the direct 
ancestors and the direct descendants of u, i.e., the two sets of nodes correspond- 
ing to the result of primitives pred(w) and succ(m). For each such set, references 
to all the updates in the set are stored as well as the cardinality of the set. 

Summing up over all nodes u € U, it results in 2m log P bits for encoding 
the cardinalities of the two sets pred('u) and succ(m), and 2a log m bits for 
referencing all the nodes in such sets. For a vertex-based SMC, the above cost 
reduces to: 

— 8n -I- 6h log n in two dimensions (P = 16, a « 3m), and 
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— 10n+ 12nlogn in three dimensions {P = 32, a « 6m). 

The implementation of operations pred and succ is straightforward, and 
their time complexity is linear in the number of the output nodes. 

Klein and Gumhold [14] propose a DAG encoding with a reduced memory 
cost. For each node u, they define a cyclic linked list, called a loop, which contains 
update u followed by all its direct ancestors (i.e., the loop of u contains the set 
of nodes defined by pred(u) U {m}). Figure 2 shows a DAG and the loops needed 
to encode it. 

A node u appears in its own loop and in all the loops defined by its direct 
descentants. For each of such loops, u stores a forward pointer (implementing 
the linked list) plus an integer, the loop identifier, which is used to distinguish 
the loop each pointer belongs to. The number of loops (i.e., #succ(m) + 1) is 
also stored at u. 

The total number of links to describe the arcs of a SMG is thus a + m. Since 
each node has at most P direct ancestors or descendants, we need logP bits for 
a loop identifier and to count the loops to which a node belongs; a pointer to a 
node requires log m bits. 

The number of bits to encode the whole DAG is m log P + (log P + log m) ( a + 
m) = m(2 log P + log m) + o(log P + log m) . In the special case of a vertex-based 
SMG, we have: 

— 20h -|- 4hlogh bits in two dimensions, and 

— 40h -|- Thlogh bits in three dimensions. 

This way of encoding the DAG supports operations pred with an opti- 
mal cost. On the contrary, implementing succ(u) needs examining all the loops 
traversing node u, which leads to a sub-optimal time complexity in the worst 
case. However, such complexity is amortized in the selective refinement algo- 
rithm; therefore this data structure exhibits an efficient expected behavior even 
in this case. 

5.2 Encoding Updates 

In the general case, each node u is represented as two sets of fc-simplices u~ and 

where simplices are given as tuples of vertices. Vertices, simplices, and nodes 




Fig. 2. A DAG (on the left) and the loops needed to encode it (on the right) 
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are stored in arrays. The simplices introduced by a given update are encoded 
in consecutive positions in the simplex array. If two nodes Ui and Uj are stored 
at consecutive positions in the node array, then the simplices of Ui^ are stored 
next to the simplices of Uj^ in the simplex array. 

For each vertex, we store its d coordinates. For each simplex, we store ref- 
erences to its fc -I- 1 vertices. For each node u, we store the number of simplices 
in u~ plus references to all simplices in u~ , and a reference to the first simplex 
in u~^ . Given a node u, all the simplices of set are retrieved by scanning the 
simplex array starting at the first simplex of u~^ and stopping at the first simplex 
of the update following u in the node array. This requires a linear time in the 
cardinality of . 

Encoding vertices requires 32dn bits. The overall size of the simplex array is 
{k + l)slogn bits. Summing up over all nodes rt S 17, it results in mlogP bits 
for encoding the number of simplices in u ~ ; s log s bits for storing all sets u ~ ; 
and 771 log s bits for referencing the first simplex in u~^ . 

Disregarding the cost of purely geometric information (i.e., vertex coordi- 
nates), we concentrate on the space needed to represent updates and the sim- 
plices involved in them. 

In the two-dimensional case, this is equal to 4r?7 -|- (s -|- m) logs -I- 3s log m 
bits, which, for vertex-based SMCs, is bounded from above by 25h -|- 25hlogri. 

In the three-dimensions, we have bm -I- (s -I- m) log s -I- 4s log m bits, bounded 
by 100h-|- Qlhlogh in the case of a vertex-based SMC. 

Assuming that the current complex Ss is encoded in an indexed data struc- 
ture without adjacencies, operation refinement (resp., abstraction) can be 
performed on Es by simply deleting the simplices of u~ (resp., u'^), and replac- 
ing them with the simplices of u~^ (resp., u~). This can be done in time linear 
in #(m“ U u~^), i.e., optimal. If needed, we can reconstruct all adjacency rela- 
tions of the output complex in a post processing stage, in computational time 
0{N log N), where N is the number of {k — l)-facets in the output complex. 

Table 3 summarizes the memory requirements of an explicit encoding of a 
vertex-based SMC in two and in three dimensions. 



Table 3. Summary of memory requirements for explicitly encoding a vertex- 
based SMC 





standard 


loop-based 




dimension 


dependencies 


dependencies 


updates 


2D 


8n -1- 6n log n 


20n -I- 4n log n 


25n -1- 25n log n 


3D 


lOn -t 12n log n 


40n -I- 7n log n 


lOOn -|- 91n log n 
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6 Compact Encoding of Vertex-Based Updates 

In this Section, we present compact data structures for representing updates in 
the class of vertex-hased SMCs in two and three dimensions. Such structures 
exploit the specific properties of such type of updates in order to reduce space 
complexity. The idea is to encode an implicit, procedural, description of an 
update, which contains sufficient information to perform operations refinement 
and abstraction on the current complex. 

If M is a vertex insertion/removal, operation refinement(tt, S, Es) reduces to 
locating a fc-dimensional polytope tTu on the current complex Es, which bounds 
the set of simplices of u~: all simplices internal to tt^ must be deleted. The 
simplices of u~^ are created by simply joining each face of tTu to vertex Vu- 
The converse operation abstraction(M, S, Es) consists of deleting all simplices 
incident in Vu from Es (i.e., simplices that form u"*"), and re-tesselating the 
resulting polytope tt^, without using any internal point. In general, tTu admits 
more than one tesselation: we must recreate the one corresponding to u~ . 

A family of encoding structures for vertex-based updates can be developed 
that do not store all the simplices in u~ and (as in the explicit data structure), 
but just those information, that are sufficient to perform the following two tasks: 

1. Given the current complex Es and an update u, such that u~^ is feasible, 

recognize the simplices forming u~ among those of Es- 

2. Given the polytope tTu bounding u~ , build u~ . 

In all data structures we present. Tasks 1 and 2 need a boundary (fc — 1)- 
dimensional facet 7 ^ of tTu as a starting point. When either update u~^ or u~ 
is to be performed on the current complex Es, we need to identify facet ju on 
Es- The way to identify it is different in case we are going to perform u~^ (i.e., 
u~ C Es) or u~ (i.e., C As). In both cases, we rely on labelling all simplices 
of the current mesh u~^ with integer numbers. 

In the case of a dynamic algorithm for selective refinement, a simplex a G Es 
can have been generated either during expansion or during contraction. Simplex 
(T is labelled with one bit to discriminate between the two cases, and with an 
integer on log P bits which uniquely identifies a among the simplices that have 
been introduced in Es together with a. This label is generated in the following 
way. When we perform an update u~^ on Es, we label the new simplices (i.e., 
those of M+) in an (arbitrary) order which is always the same every time the 
update is performed, and such that simplex cr„+ containing 7 ^ is the first one. 
When we perform an update u~ , we label the new simplices (i.e., those of u~) in 
an (arbitrary) order which is always the same every time the update is performed, 
and such that simplex a^- which contains facet is the first one. Figure 3 
shows a two-dimensional example of simplex labelling in the two cases. In a 
static algorithm, we have a simplified case since contraction is never performed 
and, thus, all simplices are generated during expansion. 

All structures that we present share a common framework that is described 
below. They all store the vertices of the SMG, each as a tuple of coordinates, 
with a cost of dn bits. Since each update u corresponds to a vertex Vu, updates 
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Fig. 3. (a) The current state of a selective refinement algorithm seen on the MT: 
the thick line separates nodes in S from nodes not in S, denoting that u' € S 
and u ^ S. (b) The current complex in case is generated by executing . 
(c) The current complex in case is generated by u~ 



and vertices are re-numbered in such a way that a node u and its corresponding 
vertex Vu have the same label. Thus, the relation between and Vu is encoded 
at a null cost. 

Moreover, for each update rt, we encode information needed to retrieve 7^ 
when we are going to perform (i.e, when u~ C Eg)'- 

— an index on log P bits that identifies the simplex € u~ such that 7^ is 
a facet of cr„- ; 

— an index on log(fc -|- 1) bits that identifies 7„ among the k + 1 {k — l)-facets 
of cr„- ; 

and information to retrieve 7^ when we are going to perform u~ (i.e, when 
M+ C Es'. 

— an index on log P bits that identifies the simplex ct„+ G u~^ such that 7„ is 
a facet of ct„+ ; 

Note that we do not need an index to identify 7^ among the (fc — l)-facets of 
cFu+i since 7^ is automatically identified as the facet of cr„+ opposite to Vu- 
Encoding such information requires 2 log P -I- 2 bits per update, i.e., lOn bits 
in two dimensions and 12n bits in three dimensions. 

In order to retrieve cr„- from the index stored in u, we need to know the 
update m', among the direct ancestors of u, such that G u'~^. Such update 
u' is encoded at no cost by adopting the convention that, in DAG encoding, 
pred(u) contains u' in the first position. 

The index stored with an update u to identify cr„- is the same as the number 
that cr„- receives when cTj;- is created in the current complex by performing 
update u’'^ (in Fig. 3b, this index is 4). This information, together with the 
node u' such that cr„- S u'^ ^ enables us to retrieve cr„- on the current mesh, 
in case is in Es because it has been created by performing u’'^ . In case 
is in Es because it has been created by performing u~ , then cr„~ is simply 
retrieved as the simplex having the first label among the ones generated by u~ . 
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In order to retrieve cr„+ from the index stored in u, we need to know the 
update u” , among the direct descendants of u, such that a^+ G u” . Such 
update u" is encoded at no cost by adopting the convention that, in DAG en- 
coding, succ(tt) contains u" in the first position. If the compact DAG encoding 
is adopted, then the loop corresponding to u” is the first one among the loops 
traversing u, after the loop of u itself. 

The index stored with u to identify simplex cr„+ is the same as the number 
that cr„+ receives when cr„+ is created in the current complex by performing 
update u" . This information, together with the node u” such that cTu+ G u" , 
enables us to retrieve cr„+ on the current complex Ag, in the case in which 
fj^+ has been created by performing u" . If cr„+ is in the current complex Ag 
because it has been created by performing then cr„+ is simply retrieved as 
the simplex having the first label among the ones generated by . 

In order to perform operations refinement and abstraction, the current 
complex Ag needs to be encoded into an indexed structure with adjacencies (see 
Section 4.1). 

The remaining information stored at a node u, i.e., the encoding of u~, are 
described in the following subsections for the two- and for the three-dimensional 
case, respectively. 

6.1 2D Vertex-Based Updates 

To encode the triangulation u~ of a node u, a stream of two bits for each triangle 
can be used, following an approach proposed by Taubin et al. [22] in the context 
of progressive mesh compression. 

The construction of the bit stream is performed through a recursive procedure 
that encodes one triangle a G u~ at a time. At each recursive call, the algorithm 
maintains a current triangulation T and a current edge e on its boundary, and 
generates the code for a triangle a bounded by e. At the beginning, the current 
triangulation T is initialized as u~ and the current edge e as Triangle a is 
the triangle of u~ lying on the left side of e. The algorithm looks at the two 
edges of triangle cr different from e and at the two parts in which such edges 
partition the current triangulation in order to assign a code to a and to decide 
about recursive calls to be made: 

— If both parts are non-empty, then code 00 is added to the bit stream. 

— If only the left part is non-empty, then code 01 is added. 

— If only the right part is non-empty, then code 10 is added. 

— If both parts are empty, then code 11 is added. 

Then, a recursive call is performed on each of the non-empty parts, starting from 
the boundary edge it shares with triangle a. On each branch, the recursion stops 
when a code 11 is generated. Figure 4a shows the codes generated for a sample 
mesh u~ . 

Encoding an update u, such that u~ contains h triangles, requires 2h bits. 
The value of h does not need to be encoded since the structure of the bit stream 
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Fig. 4. (a) Codes assigned by Taubin’s method to the triangles of u ; arrows 
correspond to the traversal of u~ . (b) Topological reconstruction of u~ 



(basically, a preorder visit of the recursion tree, where leaves are characterized 
by code 11) can be exploited to reconstruct its length. The total length af all 
bit streams is equal to ~ ~ which is about 8h in our case 

(two-dimensional, vertex-based SMCs). 

In order to perform task 1 in operation refinement, we run the same recur- 
sive procedure described above, starting from edge 7 ^ on the current mesh Us 
as our working edge e. In this case, the bit stream is known, and we need to 
recover u~. At each call, we consider the current working edge e, and recognize 
the triangle a, adjacent to e, as belonging to u~. Then, depending on the code 
found in the stream, we decide, for each edge of a, different from e, if a recursive 
call must be invoked on that edge. The recursive calls progressively “consume” 
the bit stream, so that each call always reads the next code which has not been 
read by any call executed before it (either at a deeper or at an outer recursion 
level) . 

Task 2 in operation abstraction is performed in a very similar way. In this 
case, at each step, we generate a triangle cr, instead of recognizing it on the 
current mesh. Note that this procedure allows reconstructing just the topology 
of triangle mesh u~ : when we create a triangle cr, we do not know which vertex 
of 7 T„ must become the third vertex of cr. However, when the topology of u~ 
has been built, it is easy to map the vertices of 7 r„ onto it (see Fig. 4b). This 
simply reduces to a synchronous traversal of 7 r„ and of the boundary of the 
triangulation, starting at one of the two vertices of 7 „ (for which the mapping 
is known). 

For each node u, the computational cost of performing either task 1 or task 
2 , with the methods described above, is linear in the number of triangles of u~ . 

6.2 3D Vertex-Based Updates 

In the three-dimensional case, tetrahedralization u~ is described as a tetrahedron 
spanning tree rooted at cr^- , which is encoded as a bit stream. The tetrahedron 
spanning tree is constructed as follows. We start from cr,^- S u~ , and traverse all 
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Fig. 5. The tetrahedron spanning tree 



tetrahedra of m in a breadth-first fashion. Each triangular facet of a traversed 
tetrahedron is labeled in the following way: 

— with 0, if the facet is a face of polyhedron 

— with 10, if the facet is adjacent to a tetrahedron that belongs to u~ and has 
not yet been traversed; 

— with 11, if the facet is adjacent to a tetrahedron that belongs to u~ and has 
already been traversed. 

Three facets are labelled for each tetrahedron. This is also true for CTu-, since 
we know that one of its facets (i.e., 7^) is on tt^. An example of a tetrahedron 
spanning tree with facet labelling is shown in Fig. 5. 

If u~ contains h tetrahedra and has q external facets, then the stream con- 
tains q—l + 2{3h— (q — 1)) = 6h — q+l bits . Note that the number q of external 
facets is equal to the number of tetrahedra in u~^. The bitstream length does 
not need to be stored since the structure of the bit stream can be exploited to 
reconstruct its length. 

By summing up over all nodes, we have -I- 771 

= 6(s — s) — s -|- TO bits, which, in the case of three-dimensional vertex-based 
SMCs, reduces to 6(18h — 6n) — 18h + n = 55n bits. 

In order to perform Task 1 in the refinement operation, we start from cr„- , 
and use the bit stream as a mask to retrieve all tetrahedra of u~ by travers- 
ing them in the same sequence as they have been traversed for creating the 
tetrahedron spanning tree. 

In order to perform Task 2 in the abstraction operation, we start from 
and scan the bit stream to recreate the tetrahedra of the spanning tree one at 
a time. Each time we traverse a facet marked 0, we glue it to a face of 7r„; each 
time we traverse a facet marked 10, we create a new tetrahedron of u~; each 
time we traverse a facet marked 11, we glue two tetrahedra of u~ at that face. 
The other face to be glued (in cases 0 and 11) is uniquely determined because 
the tetrahedralization we are creating has no internal vertices. 
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7 Comparisons 

For the purpose of comparison, we consider the space required to encode the 
reference complex in an indexed data structure with adjacencies (see Section 4.1). 
The space required by the reference complex with such structure is equal to dfi 
bits for the vertex coordinates, while connectivity data require 6h -I- 12hlogh 
bits in two dimensions, and 72h -|- 48hlogh bits in three dimensions. 

The storage cost for vertex coordinates is the same, i.e, dn bits, also in all 
SMC data structures. Encoding the partial order (i.e., the DAG) of a SMC in the 
standard way requires 8h -I- 6fi log h bits in two dimensions, and lOh-l- 12hlogh 
bits in three dimensions. With compact DAG encoding based on loops, this cost 
becomes 20n + 4h log h bits in two dimensions, and 40h -|- 7h log h bits in three 
dimensions. 

The amount of space needed to encode the updates through the explicit data 
structure is equal to 25h -I- 25hlogh in two dimensions, and lOOh -I- 91hlogh 
in three dimensions. Adopting the compact encoding of updates described in 
Section 6 reduces the cost of encoding all updates to 14h and 62h in the two 
and in the three dimensional case, respectively. 

In Tables 4 and 5, we compare the storage requirements of the reference 
complex, of the explicit data structure, and of the data structure using com- 
pact update encodings, in two and three dimensions, respectively; for the two- 
dimensional case (see Table 4), we also show the cost of the data structure 
proposed by Klein and Gumhold in [14]. We have considered vertex-based SMCs 
with lOK, lOOK, 500K, IM and 2M vertices. The costs are expressed as number 
of bits per vertex; geometric information (i.e., vertex coordinates) are equal in 
all structures and, therefore, are not considered in the tables. The number of 
bits shown for the data structure of Klein and Gumhold does not include the 
representation of equivalence classes of triangulations. In [14], the authors report 
that, for lOOK vertices, 120 bits per vertex are necessary, which allows thinking 
that, in general, the total cost of the structure is about twice the number of bits 
reported in the last column of Table 4. 



Table 4. Gomparisons of data structures in the two-dimensional case. We show 
the sizes of the reference complex, of explicit SMG data structure with both 
variants of DAG encoding (standard or compact) , of the data structure for SMGs 
with compression of updates, with both variants of DAG encoding, and of the 
data structure in [14] 



n 


refer. 

complex 


expl. SMC 
std. DAG 


expl. SMC 
comp. DAG 


comp. SMC 
std. DAG 


comp. SMC 
comp. DAG 


[14] 


lOK 


174 


467 


451 


110 


94 


117 


lOOK 


210 


560 


538 


128 


106 


135 


500K 


234 


622 


596 


140 


114 


147 


IM 


246 


653 


625 


146 


118 


153 


2M 


258 


684 


654 


152 


122 


159 
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Table 5. Comparisons of data structures in three dimensions. We show the sizes 
of the reference complex, of explicit SMC data structure, with both variants of 
DAG encoding, of the data structure for SMCs with compression of updates, 
with both variants of DAG encoding 



n 


refer. 

complex 


expl. SMC 
std. DAG 


expl. SMC 
comp. DAG 


comp. SMC 
std. DAG 


comp. SMC 
comp. DAG 


lOK 


744 


1552 


1512 


245 


205 


lOOK 


888 


1861 


1806 


281 


226 


500K 


984 


2067 


2002 


305 


240 


IM 


1032 


2170 


2100 


317 


247 


2M 


1080 


2273 


2198 


329 


254 



We compare the data structure adopting a compact representation of updates 
against an explicit one, when the DAG is encoded in the same way. We have that, 
in two dimensions, the compact version saves about 77% — 78% with respect to 
the explicit one (with standard DAG encoding), and 80% — 82% (with compact 
DAG encoding). In three dimensions, we have a 85% — 86% saving with standard 
DAG encoding, and a 87% — 89% saving with compact DAG encoding. 

Now, we compare the two structures using compact update encoding with 
respect to the reference complex. In two dimensions, such structures save about 
37% — 42% (with a standard DAG representation), and about 46% — 53% (with 
a compressed DAG); the memory reduction increases for larger values of h. In 
three dimensions, the above figures are 68% — 70% (with standard DAG) and 
73% — 77% (with compressed DAG). 

In two dimensions, the use of a compact DAG representation gives a storage 
compression of about 5% (with explicit update encoding) and 15% — 20% (with 
implicit update encoding) , where the best reduction is achieved for larger values 
of n. In three dimensions, the compression rate is about 4% (with explicit update 
encoding) and 18% — 23% (with implicit update encoding). 

An explicit encoding of updates allows a lighter working data structure for 
the current mesh during selective refinement, since maintaining adjacencies is 
not needed. But if adjacencies are required on the output complex, then there 
is an overhead to reconstruct them as a post-processing step. Implicit update 
encodings need to maintain adjacencies during selective refimenent, but they 
produce an output complex with adjacencies at no additional cost. 

8 Concluding Remarks 

In this paper, we have considered a specific class of multiresolution simplicial 
complexes, built based on a vertex-removal/insertion simplification strategy in 
two and three dimensions. We have described compact encoding structures for 
such SMGs. SMGs built through vertex removal from a mesh at the maxi- 
mum resolution, or through vertex insertion from a mesh at a coarse resolu- 
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tion are extensively used in applications for modelling surfaces and two- and 
three-dimensional fields. 

The resulting data structures are particularly interesting since we have shown 
that they achieve a compression of almost 80% (in two dimensions) and 87% (in 
three dimensions) with respect to a general-purpose structure for encoding the 
SMC and a compression of almost 50% (in two dimensions) and 70% (in three 
dimensions) with respect to encoding the mesh at the maximum resolution. 

It would be interesting to compare, in terms of performances, a vertex-based 
SMC described as a compressed data structure with hierarchical data structures 
based on regular recursive subdivisions even if these latter have a more restricted 
applicability (they can be used only with regularly distributed data points and 
not for free-form surfaces). 

Further developments of the research described in this paper are investigating 
issues related to the study of more compact data structures for storing the DAG, 
the design of a vertex-based update encoding in the arbitrary fc-dimensional case, 
and the development of out-of-core data structures for a vertex-based a SMC as 
well as out-of-core algorithms for selective refinement. 
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Abstract. We introduce a polyhedral representation of surfaces for anal- 
ysis and recognition of three-dimensional digital images. Our represen- 
tation is based on combinatorial topology. By using a discrete version of 
combinatorial topology we also present an algorithm for reconstruction 
of a polyhedron in a discrete space from a set of lattice points. 



1 Introduction 

In this paper, given a 3-dimensional digital image, we present a method to recon- 
struct a polyhedron for a 3-dimensional object in the image. Such polyhedriza- 
tion problem is one of the fundamental problems in the fields of digital image 
analysis, recognition and visualization, and has been called the border tracking 
problem [1,2,3]; sometimes the words of “border” and “tracking” are replaced 
by other words such as “boundary” or “contour” and “following”, respectively. 

We state our polyhedrization problem more precisely and mathematically. 
Let us consider a 3-dimensional discrete space such as the set of all lattice 
points whose coordinates are all integers in the 3-dimensional Euclidean space 
R3 as the mathematical space model for a 3-dimensional digital image. Given a 
subset V of for a digitized 3-dimensional object, we consider construction of 
a polyhedron such that 

(A) the vertices of the polyhedron are all lattice points which are the border 
points of V in the sense of general topology /set theory [4]; 

(B) any adjacent vertices of the polyhedron are neighboring in the sense of the 
traditional 6-, 18- or 26-neighborhood system for digital image analysis [3]; 

(C) it has the topological structures of two-dimensional surfaces in the sense of 
combinatorial and algebraic topology [5,6]. 

We call such a polyhedron a discrete polyhedron hereafter. 

In the following sections, we first give the classical set-theoretical definition 
of border points of V C Z^ [3,7,8] and see that the topological surface structures 
may be useful for border tracking in Z^. Among the various approaches for defin- 
ing discrete surfaces in Z^, we then choose an approach based on combinatorial 
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thanks to the JSPS postdoctoral fellowships for research abroad from October 2000. 
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and algebraic topology [5,6]. To construct a discrete polyhedron satisfying the 
conditions of (A), (B) and (C) in Z^, we need to build up the discrete version of 
combinatorial and algebraic topology. We define discrete cells of the dimension 
from 0 to 3 under the conditions (A) and (B) and propose an interpretation of 
a digitized 3-dimensional object V as a cellular complex consisting of discrete 
cells. The same concept for the image interpretation by cellular complexes is 
already introduced by several authors such as [7,9]. In [9], for instance, only 
the 6-neighborhood system is considered while we consider either of 6-, 18- or 
26-neighborhood system. Thus, we say that our framework is more general in 
the sense of neighborhood systems. In [7], 3-dimensional discrete cells which 
are called “continuous analogs” are defined in a different way from ours for the 
different purpose which is the digital fundamental group. 

For practical use, we also present an effective polyhedrization algorithm to 
generate a discrete polyhedron directly from a given subset V of similarly 
to the marching cubes algorithm [10]. We have presented a similar algorithm 
in our previous work [11], but the theory is limited only for the cases of 6- 
and 26-neighborhood systems. By using cellular complexes instead of simplicial 
complexes for a 3-dimensional image interpretation, we succeed to extend our 
work for all 6-, 18- and 26-neighborhood systems. Finally, we clarify the the 
relation between a set of boder points based on set theory and our discrete 
polyhedron. 



2 Border Points and Surfaces in 
2.1 Border Points in Z^ 

Similarly to the definition of border points in [4] , a set of border points of 
a finite subset V in Z^ is defined by using a neighborhood system. We have 
the TO-neighborhood system in Z^ such as 

Nm(a:) = {y e : ||a; - y|| < Vt}, (1) 

where t = 1,2,3 for each m = 6, 18, 26. If Nm(:r) C V, then such a point x is 

called an interior point of V [3,8]. The set of interior points of V is called the 
interior of V and denoted by 

Intm(V) = {x€V: N„(a;) C V}. 

Clearly, Intm(V) depends on the value of m for m = 6, 18, 26. If a point a; € V 
is not an interior point of V, then x is called a border point of V [3,7,8] and the 
set of all border points is denoted by 

Br„(V) = V \ Intm{V),= {xgV: N„(a^) C V ^ 0}. (2) 

Let us say that Brm{y) is connected or m'-connected if any pair of a;,y S 

Brm{y) has a point sequence X\ = a;, X 2 , . . . ,Xk = y such that all Xi € Brm{y) 
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and G Nm'(xi). It is known that Bre(V) is 18- or 26-connected and 

-Bris(V) and Br 2 e(V) are 6-connected [7]. Note that m! . 

Let us consider boundary points. Roughly speaking, in Z^, a finite subset 
V and the complement V have “different boundaries” even if there exists “the 
common boundary” in [4]. It is easy to see that Brm(V) r\Brm(V) = 0 from 
(2). Clifford explained such “different boundaries” using an example such as a 
heap of white marbles on the top of which black marbles are put in [12]. He 
declared that the boundary of the white part would be a layer of white marbles 
and the boundary of the black part would be a layer of the black marbles, that is, 
the two adjacent parts have different boundaries when they are divided into two 
parts. We admit the existence of “different boundaries” in as well as other 
work such as [1,8]. We focus on one of the “boundaries” which is the border 
Brm{y) of V and construct a discrete polyhedron such that all the vertices are 
in Brm{y)- In the historical sense, such problem is related the border tracking 
problem presented in the next subsection. 

2.2 Border Tracking in Z^ 

One of several well-known approaches for border tracking in Z^ is tracking the 
common faces between two unit cubes centered at the points p in V C Z^ and 
q in V = Z^ \ V. Such faces are represented by the ordered pair (p, q) [2]. Since 
q € Ng(p) n V, the set of all p of such pair (p, q) becomes equal to Br^iY) of 
(2). In this approach, the “surface” is represented by a set of square faces so that 
the square faces are of cubes whose centroid are lattice points p. The topological 
structures of such “surfaces” are given as cellular complexes in [13]. However, the 
vertices of a polyhedron are not lattice points because the faces exist between 
lattice points. Since we would like to take the purely discrete approach, we first 
consider how to to construct a topological structure for 2-dimensional surfaces 
only using lattice points in Z^, and then clarify the relation between such surfaces 
and the set-theoretical border points. 

2.3 Definitions of Surfaces in Z^ 

The approaches for defining 2-dimensional surfaces in Z^ are mainly classified 
into the following three types: 

1. the approach based on Jordan surface theorem [-5] so that a set of lattice 
points is defined as a discrete surface if it divides T? into two regions and 
the thickness of the set is as thin as possible [14,15]; 

2. the approach based on algebraic topology [6] so that the 2-dimensional sur- 
faces is defined as the boundary of 3-dimensional objects [2,8,11,13]; 

3. the approach by using the topological structures of the 2-dimensional man- 
ifold [16]. 

In fact the first approach is considered for the different problem from our 
problem. Our problem is construction of a polyhedron from a subset V in T? 
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1 

Fig. 1. Examples of non-manifold polyhedral surfaces 

although the problem for the first approach is determining a set of lattice points 
in corresponding to a given 2-dimensional surface in R^. It is sometimes 
called digitization or discretization. The third approach only allows us to treat 
the surfaces which is topologically equivalent to the 2-dimensional manifold. 
Because we would like to treat non-manifold surfaces as shown in Figure 1, we 
take the second approach for our definition of discrete polyhedral surfaces. 

3 Combinatorial Definition of Discrete Polyhedra 

3.1 Discrete Cellular Complexes 

Referring to combinatorial and algebraic topology in the n-dimensional Eu- 
clidean space R" [5,6], we first define n-dimensional cells or n-cells and then 
form cellular complexes by gluing cells. 

Definition 1. An n-cell is a set whose interior is homeomorphic to the n- 
dimensional disc Ur(a;) = {y € R" : jja; — yjj < r} with the additional property 
that its boundary must be divided into a finite number of lower dimensional cells, 
called the faces of the n-cell. We write a < t if a is a face of r. 

0. A 0-cell is a point A. 

1. A 1-cell is a line segment a = AB, and A < a, B < a. 

2. A 2-cell is a polygon (often a triangle), such as a = A ABC, and then 
AB, BC, AC < a. Note that A < AB < a, so A < a. 

3. A 3-cell is a solid polyhedron (often a tetrahedron) , with polygons, edges, and 
vertices as faces. 

The point of a 0-cell, the endpoints of a 1-cell and the vertices of 2- and 3-cells 
are called the vertices of each cell. 

Definition 2. A complex K is a finite set of cells such that 

1. if <7 G K, then all faces of a are elements o/K; 

2. two cells tr, T G K are either disjoint or their intersection is a common face. 

The dimension o/K is the dimension of its highest- dimensional cell. 

An n-dimensional complex or n-complex K is said to be pure if every cell 
with less than n dimension of K precedes some n-cell [5] . 
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Table 1. All discrete n-cells for n = 0, 1, 2, 3 such that the vertices are all lattice 
points in and the adjacent vertices are m-neighboring for m = 6, 18, 26 



discrete cells 



dim. 


Ne 


Ni8 


N26 


0 




'i 


5- 


^'M 


1 




P2ai P2bi 

mm 


P2ai P2bi P2ci 


2 




1 


P3C| P4fi 


3 


P3bA P3c^k P4a| P4f t 


3 




1 

1 

2 


P4gi P5b| P5c* 

P6bi P6ct P7 t 

5 - 


P4bi P4ci P4di -P4e* P4gA 

P5ai P5bi P5ci P6^ P6b* 



In this paper, we consider complexes such that the vertices of cells in com- 
plexes are all lattice points in and the vertices of any 1-cell in complexes 
are neighboring in the sense of the 6-, 18- or 26-neighborhood of (1). For con- 
structing such complexes, we first list up all possibles cells whose vertices are all 
lattice points and the adjacent vertices of a cell are m-neighboring each other 
for m = 6, 18, 26. Because of the constraint of m-neighboring, we set a unit cube 
whose vertices are all lattice points to be the maximum size of the cells. In other 
words, we look for the cells which exist in a unit cube. 

For each lattice point in a unit cube we assign the value of either 1 or 0 
and call the point a 1- or 0-point, respectively. There are 256 arrangements of 
1- and 0-points for the eight lattice points in a unit cubic region. In fact, we can 
reduce the number from 256 to 23 with considering the congruent arrangements 
by rotations. For each arrangement, we obtain a convex polyhedron such that the 
vertices of the polyhedron are 1-points. We then classify each polyhedron into 
a set of n-cell with the dimension of n = 0, 1, 2, 3 and with the m-neighborhood 
relations between the adjacent vertices for m = 6, 18,26 as shown in Table 1. 

From Table 1, we see that the finite number of cells are obtained under the 
constraints such that the vertices of cells in complexes are all lattice points in 
Z^ and any adjacent vertices are m-neighboring for m = 6, 18, 26. Hereafter, we 
call the cells in Table 1 discrete cells. We also call a complex which is a finite 
set of discrete cells a discrete complex. 
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3.2 Definition of Discrete Polyhedra: Boundary of Discrete Objects 

For defining discrete polyhedra, we need some basic topological notions. Let K 
be a complex. If Kq is any subcomplex of K, the complex consisting of all the 
elements of Kg and of all the elements of K which precedes (i.e. is less than) at 
least one element of Kg is called the combinatorial closure CT(Kg) of Kg in K 
[5]. Now we give the combinatorial definitions of 3-dimensional discrete objects 
and the boundary. 

Definition 3. Any pure discrete 3-complex is a discrete object. 

Definition 4. Let O be a discrete object and H be the set of discrete 2-cells 
in O each of which precedes exactly one discrete 3-cell in O. The combinatorial 
boundary of O is defined as 

do = Cl{U). 



Our definition is based on the definition of the boundary in algebraic topol- 
ogy [6]. For the boundary of an n-complex, we need the concept of oriented 
(n— l)-cells. Each discrete 2-cell which is a face of a discrete 3-cell has a particu- 
lar ordering of its vertices so that the order is counterclockwise from a viewpoint 
exterior the discrete 3-cell. The boundary of a discrete object is obtained by 
the operation which is called the “modulo 2 union” of oriented discrete 2-cells. 
Therefore, a discrete 2-cell which precedes two discrete 3-cells in O is not con- 
sidered to be in dO because it means that there are two oppositely oriented 
discrete 2-cells and the modulo 2 union of them is empty. From Definition 4, the 
boundary dO is a 2-dimensional pure discrete subcomplex of a discrete object 
O. Hereafter, we call the boundary of a discrete object a discrete polyhedron. 
Note that a discrete polyhedron can be a non-manifold as shown in Figure 1 
according to the definition. Clearly, discrete complexes, objects and polyhedra 
are defined for each m-neighborhood system, m = 6, 18, 26. When we would like 
to insist that an m-neighborhood system is considered, they are denoted by C^, 
Om and dOm instead of C, O and dO. 

4 Polyhedrization of a 3D Lattice Point Set 

Given a finite subset V of Z^, we consider polyhedrization of V. The procedure is 
divided into three stages: decomposition of V into discrete cells whose dimensions 
are from 0 to 3 such that those discrete cells constitutes a discrete complex C^, 
making pure to obtain a discrete object O^, and extraction of the boundary 
dOm of Om for each m-neighborhood system, m = 6, 18, 26. We also present a 
direct algorithm generating dOm from V. In this section, we will omit m when 
it is clear that one of the 6-, 18- and 26-neighborhood systems is considered 
through the procedure for polyhedrization. 
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(a) (b) 



Fig. 2. (a) Another possible discrete complex of the 18-neighborhood system for 
the arrangement P5a and (b) an example of the problem which may be caused 
by the cellular decomposition (a) during construction of a cellular complex 



4.1 Cellular Decomposition of a 3D Lattice Point Set 

The decomposition of V into discrete cells is achieved in two stages. For x = 
{i,j, k) in Z^, let 

D(a?) = {(i -k ei,j + t 2 ,k+ £ 3 ) | = 0 or 1}. 

We say that the points of V are 1-points and the points of \ V are 0-points. 
For each a; e Z^, let C(a;) denote the discrete complex generated by the set 
of 1-points in D(a;) as shown in Table 2. Finally, let 

C= U C(a;). (3) 

ccGZ3 

We will now prove that C is a discrete complex. Say that C(a;) and C(y) are ad- 
jacent if D(a;) nD(y) yf 0. Their adjacency types are classified into the following 
three cases 

#(D(a;) n D(y)) = 1,2 or 4 (and never 3) 

where #(A) represents the number of elements of the set A. The adjacency types 
and the conceivable cellular decomposition at the joint are illustrated in Table 3. 
For each adjacent pair of C(a;) and C(y), let C(a;, y) = C{x) U C(y). We then 
verify, from Table 2, that C{x, y) is a discrete complex satisfying the conditions 
of Definition 2. 

For the arrangement P5a in the case of 18-neighborhood system, we do not 
take the discrete complex as shown in Figure 2 (a) to avoid the problem such as 
Figure 2 (b) which may occur in the procedure of constructing a cellular complex. 
Instead of Figure 2 (a), we have the discrete complex for the arrangement P5a in 
Table 2 (b). Consequently, we see that we can uniquely obtain C (or Cm) from 
(3) which is a discrete complex for any m-neighborhood system, m = 6, 18,26, 
given a finite subset V C Z^ . 

4.2 Construction of a 3D Pure Complex 

Assume that the dimension of C is three. Let G to be a set of all discrete 
3-cells in C. In order to obtain a pure discrete complex O from C, we remove 
all discrete n-cells which are not included in any discrete 3-cells in C for every 
n < 3. Thus, the pure discrete complex O is obtained such that 



O = Cl{G). 



(4) 
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Table 2. A discrete complex C{x) corresponding to each arrangement of 
1-points in a discrete unit cube D(a;) for the (a) 6-, (b) 18- or (c) 26-neighborhood 
system 



la\ I 

\°) 1 -points I 



discrete complex 



# of 

1 -points 



discrete complex 



P5a : • P5b : - P5c, , ^ 

m m m 






jtL 



P4at , P4bt , P4ct , P4d 



I P4el ^ P4f 



a unit cube 
a 1 -point 
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Table 3. Three adjacency types of two unit cubic regions D(a?) and D(y) such 
that #(D(a;) n D(y)) = 1, 2 or 4. For each adjacency type, all possible arrange- 
ments of 1- and 0-points and the cellular decomposition are shown 




If C is less than three dimensions, G is empty and thus O is also empty. From 
(4), it is clear that a discrete object O is uniquely obtained from a discrete 
complex C. 

4.3 Boundary Extraction of a 3D Pure Complex 

From Definition 4, the boundary dO of a discrete object O is derived from the 
set H of discrete 2-cells in O each of which precedes exactly one discrete 3-cell 
in O. Because H is uniquely obtained from O by using the boundary operation 
defined in algebraic topology [6], the boundary 50 is also uniquely obtained 
from O. From the above procedure for polyhedrization, we consequently obtain 
the following proposition. 

Proposition 1. Given a finite subset V C Z^, a discrete polyhedron dOm, 
or the boundary of a discrete object Om, is uniquely obtained for any 
m-neighborhood system, m = 6, 18, 26. 

4.4 Discrete Polyhedrization Algorithm 

For practical use, we present an effective algorithm such that we obtain a dis- 
crete polyhedron dOm directly from a given finite subset V C for each m- 
neighborhood system, m = 6, 18, 26. In other words, we would like to skip the 
conversion steps such as a discrete complex and a discrete object be- 
tween V and dOm because we do not need them. Similarly to the marching 
cubes method [10], our algorithm generates dO^ directly from V by referring 
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/j 






^7 







Fig. 3. An example of the case in which two adjacent unit cubes have the com- 
mon 2-cell 

to Table 4 for each neighborhood system. We derived the table by looking for 
discrete 2-cells which may be in the boundary dOm and by ignoring other dis- 
crete 2-cells in Table 2. The arrow of every discrete 2-cell in Table 4 shows the 
exterior orientation of dOm', aU discrete 2-cells are oriented as we discussed in 
subsection 3.2. In other words, for each unit cube in Table 2, we choose only a 
oriented discrete 2-cell such that the exterior side is in the unit cubic region for 
obtaining Table 4. 

Algorithm 1 

input: A subset V o/ W = {(*, j, fc) G : 1 < i < L, 1 < j < M, I < k < N}. 
output: A discrete polyhedron dOm for an m-neighborhood system, m = 
6,18,26. 

begin 

1 . for l<fc<A — Ido 

for 1 < j < M — 1 do 
for 1 < i < L — \ do 

1.1 for X = (i,j,k), obtain a pure discrete 2-complex Cm{x) at a unit 
cubic region D(a:) by referring to Table 4; 

1.2 for each y = {i — 1, j, k), (i,j — 1, k), {i,j, k — 1), check if any com- 
mon discrete 2-cell a of Cm{x) and Cmiv) exists as shown in Fig- 
ure 3; if so, replace Cm{x) and Cmiu) with Cl{Cm{x) \ Cl{{a})) 
and Cl{Cm{y) \ Cl{{a})), respectively; 

2. obtain dOm = Cm(x). 

end 

We see that discrete 2-cells which locate at the joint of two adjacent unit 
cubic regions remain in Table 4 such as the arrangements P3a and P4a. If such 
discrete 2-cells are obtained so that they are not included in any discrete 3-cell 
of a discrete object Om, then we will have pairs of the discrete 2-cells which have 
the same vertices but the opposite orientations as shown in Figure 3. They do 
not constitute dOm due to the boundary operation. The step 1.3 in Algorithm 1 
is for the removal procedure of such discrete 2-cells. 

5 Relations between Border and Discrete Polyhedra 

Let us define the skeleton Sk{a) of a discrete cell a such as the set of the 
vertices of a [5]. Given V C Z^, let dOm be a discrete polyhedron constructed 
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Table 4. The look-up table which provides the one-to-one correspondence be- 
tween the arrangements of 1- and 0-points in a discrete unit cubic region and 
a 2-dimensional pure discrete subcomplex constituting a discrete polyhedron for 
each neighborhood system 




by Algorithm 1 for m = 6, 18,26. We then call the union of the skeletons of all 
discrete cells of dOm the skeleton of dOm and it is denoted by Sk{dOm)- Let 
us compare the skeleton Sk{dOm) with the border point set Brm'iy) at each 
discrete unit cube D(a;) seeing Tables 2 and 4. For instance, the comparisons 
for the arrangements P5a and P7 are shown in Figure 4. After unifying the 
comparison at every D(a;), we consequently obtain 



Br^{Y) = Sk{d02e) U Sk{C 26 \ Oae) (5) 

= Sk{dOis)USk{C,s\Ois)\Eis, (6) 

Sri8(V) = Sk{dOe) U Sk{Ce \ Og) \ Fig, (7) 

Br26{V) = Sk{dOe)USk{Ce\Oe). ( 8 ) 



In (6) and (7), we set Eig and Fig to be the sets of points each of which is given 
as shown in Figure 4. The points of Eig and Fig in fact show the difference 
between Sk{dOis) and Sk{d026) of (5) and (6) and the difference between 
Bris(V) and Br 26 {V) of (7) and (8), respectively. A discrete complex \ 
represent less than 3-dimensional parts in Cm- Such parts are ignored in the 
procedure for polyhedrization because of the lack of the dimensions. Thus, some 
points of Sk(Cm\Om) are not in Sk(dOm) even if they are in the corresponding 
Brm'iY). 
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1 -point 


Brm(V) 


Sk(^ Om) and Sk(Cm\Om) 


point set for 


arrangement 


m=6 m=18 m=26 


m=6 m=18 m=26 


adjustment 




^ ^ ^ 


^ .m ~ 






.---I p=f ^---1 p# 




b18^^p?» 


"I® 


I® ffliO 


I® I® I® 


F18^^ 



® a point of Brm(V) ® a point of Sk(d Om) 

o a point of Sk(Cm\Om) 

Fig. 4. For each of the arrangements P5a and P7 of 1-points (black points), 
the border points (gray points) i?rm(V) for m = 6,18,26, the skeletons of the 
discrete polyhedra (gray points) Sk(dOm), the points (white points) of Sk{Cm\ 
Om) for m = 6, 18,26, and the point sets Eig and Fig for adjusting (6) and (7) 



6 Conclusions 

In this paper, we presented a discrete polyhedrization scheme of a given finite 
subset V C Z^. Because we were initiated into the polyhedrization problem from 
the border tracking problem, we gave an overview of the border tracking prob- 
lem in 3-dimensional digital images and clarified the difficulty of the problem 
caused by the discontinuous of the space. We then showed the proper reasons 
for our approach based on algebraic topology in which we mainly considered 
combinatorial topology. Because discrete polyhedra are defined as the boundary 
of 3-dimensional cellular complexes, we first defined discrete cells of the dimen- 
sions from 0 to 3 and discrete complexes which is a cellular complex consisting 
of discrete cells. Due to such algebraic topological definition, we have the case 
such that a discrete polyhedron may be non-manifold as shown in Figure 1. 

Another interesting point of our approach is that we interpret a digitized 
three-dimensional object V as a cellular complex C which may not be pure and 
may not have three dimensions. Therefore, we may have the dimension reductions 
because of the object digitization or the limited resolution of digital images. Due 
to such interpretation, we can give the theoretical reason to ignore some lattice 
points in the border Br(V) for a discrete polyhedron dO and succeeded to make 
the comparison between Br(V) and the skeleton of 90. 

For the construction of a given finite subset V of Z^, we presented a similar 
look-up table to the marching cubes [10]. Therefore, we do not have to go through 
all theoretical steps such as a 3-dimensional cellular decomposition C of V, 
making the complex C pure for a discrete object O and obtaining the boundary 
do of the discrete object O; we directly obtain a discrete polyhedron 90 from 
a given finite subset V C Z^. 
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Abstract. In this paper we are interested in problems related to the 
encoding of partitions of discrete sets in R" with the restriction to the 
partitions which can be realized by a number of hyperplanes. 

A particular attention is given to the partitions of subsets of a planar 
integer grid of a given size, which is a model for a binary picture of a 
given size. 

An application of the used encoding method to the encoding of linear 
threshold functions is given, as well. It turns out, that the obtained 
0(n^) code is asymptotically optimal with respect to the required mem- 
ory space. 

Keywords; Linear partitions, coding, pattern recognition, threshold func- 
tions, discrete moments 



1 Introduction 

If A is a subset of a given set X and B equals the set difference X \ A then the 
ordered pair of sets (A, B) is a partition of X. If the partitioned set X is fixed 
we shall write A'’’ instead of X \ A. If X C R" and there exists a function of the 
form: f{xi,X2, ■ ■ ■ Xn) = ai ■ xi + Q2 ■ X2 + ■■■ an ■ Xn + b, such that 

( i) /(a:i,X2, . . . > 0 for (xi, X2, . . . , x„) € A and 

(ii) /(xi,X2, . . . ,x„) < 0 for (xi, X2, . . . , x„) e A^ = X \ A, 

then, we say that (A, A'^) is a linear partition of X. Obviously, in that 
case A and A'’’ belong to the different half-spaces determined by the hyperplane 
/(X1,X2, . . . ,x„) = 0 . 

If /(xi, X2, . . . , Xn) has the form: 

i—k 

/(xi, X2, ...Xn) = • Xi -b fli,2 • X2 + . . . + Qi^n ’ Xn + h) (I) 

i=l 

* The author is also member of the Mathematical institute of Serbian Academy of 
Sciences, under project 04M02 
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and (i) and (ii) hold, then the partition {A, is called k-linear partition of X . 
Geometrically speaking, the set X can be partitioned into A and A'^ by using k 
hyperplanes. 

If a set consists of m points it will be called m-point set. 

In this paper we are dealing only with partitions of finite sets, i.e., X is finite. 
Precisely, we are interested in the efficient encoding of such partitions in order 
to enable a fast comparison between them and to preserve their small storage 
complexity. For this purpose a knowledge about the number of partitions from 
the studied class is essential. It is not possible to give a general answer what is the 
number of fc-linear partitions, of a planar m-point set in n-dimensional space, 
as a function of k, m, and n. That is not possible even in the planar case (n = 2) 
by assuming that only one line (hyperplane) is used, i.e., k = 1. Namely, let a 
pair of points from a given set is said to be a minimal pair if there are no other 
points (from the same set) belonging to the straight line segment determined by 
this two points. Under this definition, the number of linear partitions of a planar 
set, equals the number of minimal pairs of the same set ([5]). Consequently, the 
number of linear partitions of a planar set with n points is somewhere between 
n— 1 and . Precisely, the maximum jg reached if there are no three 

collinear points (i.e., the points of the set are in so called general position). The 
minimum n — 1 is reached if all points are collinear. So, one can conclude, that 
the number of linear partitions of a given set strongly depends of the structure 
of the partitioned set, as it is already mentioned in [6] and [9]. 

Here, we give a coding scheme which can be applied to the partitions of 
an arbitrary discrete planar set by using of an arbitrary number of lines, but 
naturally, the encoding can not be efficient in all cases. We give aa detailed 
performance analysis in the case when the partitioned set is an r x r integer grid 
(Section 2). 

In Section 3, we give a coding scheme for the threshold functions defined on 
a fixed m-point subset of n-dimensional cube ({0,1}”). The coding requires an 
0{n ■ log m) amount of bits per coded function. If m = 2” that is an asymptotic 
minimum. 

The encoding is based on a use of discrete moments, which are already shown 
to be a useful tool in the area of digital objects encoding ([4], [13], [15]), as well. 

2 Encoding of Planar fc-Partitions 

In this section we give a coding scheme for the partitions (of a finite planar 
set) which can be realized with k lines. A set of discrete moments is used for 
such coding. Precisely, for a discrete set X, from R", its {pi,p 2 , . ■ . ,Pn)-discrete 
moment is defined as: 

{x\,X2,...,X-n)^X 

The order of the moment tipi,p 2 ,...,pn{^) is equal to pi -I- P 2 + ■ • ■ + Pn- 
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For the simplicity we start with n = 2, i.e., we study the encoding of fc-linear 
partitions of a planar set. 

Theorem 1 . Let {Ai,A\) and (^2,^2) he two partitions of a planar set X, 
obtained by k lines. Then, the partitions are equal, i.e., {Ai,A'i) = (^2,^2), if 
and only if 

t^p,q{Ai) = Hp^q{A2) for the nonnegative integers p, q, with p + q<k. 

Proof. If Ai = A2 then the corresponding discrete moments are equal obviously. 
What we have to prove is that the equalities of the corresponded moments of 
the order up to k preserve the equality Ai = A2. We prove it by a contradiction. 
Let 



XP • = tip,q{Al) = flp,g{A2) = ^ xP • 

(a:,y)G.4i (x,y)eA 2 

holds for the nonnegative integers p and q satisfying p + q < k. 

If Ai yf A 2 then we can assume Ai \ A 2 is nonempty, for an example (else we 
can start with the nonempty A 2 \Ti). Since (Ai,A1) is a fc-linear partition, then 
there exists a function /(x, y) of the form (1) which satisfies the conditions 
( i) and (ii). In the two-dimensional case it can be expressed as: 



f{x,y)= Y 

p-\-q<k 

for some real coefficients otp^q, with p + q < k. 
Further we have: 



o< E 

(x,y)GAi\A2 



E 

\p-\-q<k 







= E E otp.<i ■ 

p+q<k \{x,y)GAi\A2 



+ E 

p-\-q<k 



E 



. (x,y)eAinA2 




- E E 

p+q<k \ (x,j/)eAinA2 




= E 

p-\-q<k 






E 



- E 

p-\-q<k 




E 



(x,i/)eAinA2 



= E 

p+g<fc 



E 

(x,y)GA2 



xP-y« 



- E 

p-\-q<k 




E 

(x,y)GAinA2 



xP-y"^ 
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(x,y)GA2\Ai \p+q<k J 

The contradiction 0 < 0 finishes the proof. [] 

The extension of the previous theorem to the R" case, with n > 2, can be made 
in a similar manner. We give the statement without a proof. 

Theorem 2. Let (Ai,Ai) and (T 2 ,T|) be two k-linear partitions of an n- 
dimensional set X. Then, the partitions {Ai,A'i) and (^ 2 ,^ 2 ) are equal, if and 
only if 

Mpi,P2,...,Pn(^l) = Mpi,P2,....Pn (^ 2 ) 

for any choice of nonnegative integers pi,p 2 , ■ ■ ■ ,Pn with Pi+P 2 + - ■ ■+Pn < k. 

The previous theorem gives an answer how many discrete moments are 
enough for a unique identification of a fc-linear partition of a finite discrete set. 
What is important for us is: 

How many bits are necessary for a storage of the suggested set of moments? 

Obviously, a precise and general answer can not be given as a function of cardi- 
nality of the partitioned set and, the number of the lines used. It also depends 
on the “topology” of the set as well as on the “nature” of the points from the 
set (are they integers, or irrational numbers, how large they are, etc. ). 

Here we give a performance analysis of the proposed coding scheme assuming 
that the partitioned set is a subset of a two-dimensional integer grid of a given 
size, say r x r. 

Theorem 3. Order of magnitude of the number of bits for the coding of all 
k-linear partitions of an r x r integer grid is 

k^ • log r — • log k 



per coded partition. 

Proof. Since 

rr j,p+q-e2 

< JJ 

0<ai,y<r 

holds for any subset S of r x r -integer grid, it follows that the number of bits 
necessary for the storage of p.p^q{S) is upper bounded by 

[(p +9 + 2) -logr - log((p-h l)(g-f 1))1. 
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So, the total bit rate for the storage of all moments having the order less or 
equal to fc, can be upper bounded by: 




log 



p+q+2 



(p+ l)(g+ 1) 



k /k—p k—p \ 

p—O \q—0 q—0 / 



+ 6k^ + Ilk + 9 



logr - 2 • ^(fc -p+ 1) • log(p + 1) + C>(fc^) 

p—0 



+ 6/c^ + 11/c + 9 k 1 v^, , 

■ log r - 2 • ^ ln(p + 1) 



In 2 



P—0 



+2 -^p-log(p+l) + 



P—0 

k^ + 6k^ + life + 9 



fc + 1 

logr — 2 • - — — • / ln(a: + 1) dx 
In 2 Jp 



+ ■ f X ■ ln(a: + 1) dec + O(k^) 

In 2 do 



fc^ + 6fe^ + life + 9 



logr — 2 



fc^ • In fc — fc^ 



In 2 



2 / fc^ • In fc fc^ 



ln2 V 2 4 

fc^ + 6fc^ + life + 9 



C>(fc2) 

fc^ • In fc 



log’’ ) 



3 ° In 2 

which completes the proof. [] 



We want to mention that the above discussion is related to the partitions of 
an r X r-integer grid by fc lines which are in an arbitrary position. 

An upper bound for the number of partitions of r x r grid which are made 
with parallel lines can be found in [7]. A fast algorithm for counting of such 
partitions is described in the same paper. 

If the partitions are made by lines parallel to the y-axis then the number of 
moments used can be reduced. This particular situation is studied in [13]. An 
alternative for such a coding is studied in [2]. 
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A quick analysis of Theorem 3 shows that in the case of small number of the 
used straight lines the coding is efficient - by the way, if k is assumed to be a 
constant the coding scheme is optimal. This will be illustrated in the rest of this 
section. 

A discrete analytical hyperplane H(ao,ai, . . . ,an,oj) in dimension n is 
defined ([1], [10]) by the double Diophantine inequality: 



where the coefficients ao,ai,...an are integers with gcd{ao,ai, . . . ,an) = 
while w is a positive integer. Of course, xi, X2, . . . , are integers too. 

We give the following theorem showing that the encoding proposed here 
enables an optimal encoding of discrete analytical line segments. 

Theorem 4. The set of two-dimensional discrete analytical hyperplane seg- 
ments which can be inscribed into an rxr integer grid can be coded by 0{logr) 
bits per coded segment. 

That is an asymptotically optimal bit rate. 

Proof. The proof is a direct consequence of Theorem 1, and Theorem 3. Namely, 
any discrete analytical line (two dimensional discrete analytical hyperplane) 
H{ao, oi, a 2 ,Lo) can be “separated” by a function of the form 

/(x, y) = (oo + Qfi • X + «2 • 2 / - e) • (ao + ai • X + 02 • 2 / + w) 

( £T is a positive small enough real number) from the rest of r x r-integer grid. 
So, the moments 



are sufficient for the coding. 

The number of required bits for the storage is upper bounded by O(logr) 
because any of the moments used is less than r'* . [] 

3 Threshold Functions Encoding 

A switching function /(xi, X 2 , . . . , Xn) of n binary variables is defined by 
assigning either 0 or 1 to each of the 2" points (xi, X 2 , . . . , x„) S {0, 1}". 
A switching function of n variables is linearly separable if there exists a hyper- 
plane 7T in n-dimensional space, which strictly separates the “on” set /“^(l) 
from the “off” set /“^(O) . In other words, /“^(l) lies on one side of tt 
and /“^(O) on the other, and tt n {0, 1}" is empty. Linearly separable func- 
tions are also called threshold functions. A threshold function simulates a neuron 
examining its input and making its decision as to its next state. 

The following theorem shows how the set of threshold functions restricted to 
an arbitrary subset of {0, 1}" can be coded by using n integers instead of a 
using n -\- 1, so called, Chow parameters ([11])- 



n 



(xi,X2, . . . ,X„) e iL(ao,ai, . . . ,a„,w) 




To.o, 2 ^ 1 , 0 : Mo.i, fJ- 2 , 0 , 2 * 1 , 1 , 2 * 0.2 
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Theorem 6 estimates the number of bits necessary for a coding of n- 
dimensional threshold functions defined on m-point subset of {0,1}" and proves 
that such a coding is optimal in an asymptotic sense if m = 2" . 

Theorem 5. Any threshold function of /(xi,...x„) defined on a fixed set 
S C (0, 1}" can he coded uniquely by: 

X! X! ^1’ X! X2, ^ Xn-l . 

f(xi,...,Xn) = l f(xi,...,Xn) = l f {xi . ,,Xn) = l f (xi . ,,Xn) = l 

Proof. Let a set S' C (0, 1}" be given. A threshold function /(xi,...,x„) 
defines a partition of S into the sets A and A° = S \ A , where (for example) 

A consists of points in which /(xi, X 2 , . . . , x„) = 1 , while the points from 

A° = S\A satisfy /(xi, cc 2 , . . . , x„) = 0. 

Also, by the definition, A and A° are separable by a hyperplane tti : 

+ 02 • + ••■ + cTn • a;„ + 7 i = 0. For instance, let us assume 



oi • + «2 • a ;2 + • ■ • + On • + 7 i > 0 iff (xi, X 2 , . . . , x„) G A 

(let us remind that (xi, X 2 , . . . , x„) G A /(xi, X 2 , . . . x„) = 1. ) 

Let us assume that there is another partition {B, B‘^) of S which is 
corresponded to another threshold function g(xi, X 2 , . . . , x„) , which has the 
same proposed code, i.e.. 



f(xi,...,Xn) = l g{xi,...,Xn) = l f {xi . ,Xn) = l 



g(xi,...,Xn) = l 



E 



X2 = 



E 



X2, 



E 



^n—l — 



E 



^n—l 



g{xi,...,Xn) — l f {xi . ,Xn) — l g{xi , . . . ,Xn) = l 

Then, we have to show that (A,A°) = (B,B‘^) holds. It will be proved by a 
contradiction. 

Since (?(xi, X 2 , . . . , x„) is another threshold function defined on S, there 
exist a hyperplane 7 T 2 : /3i • xi + /?2 • X 2 + . . . + /3n • x„ + 72 = 0 which separates 
B and S\B. 

Without loss of generality we can assume that o;„ and /3„ are both different 
from zero (because A and B consist of a finite number of points) . 

Then there exist a hyperplane of the form 



6i - Xi + 62 - X2 + . . . + 5 n-l ■ Xn-l +7=0 
which separate the set differences A \ B and B\ A. (A possibility is 5i = 



^ — hi. for i — ^ 

Let us assume: 


2,...,n- 1, and 1 = ^ 


_ 72_ \ 
C^n ' 






Si ■ Xi + 62 ■ X 2 + 


• • • + Sn-l ■ Xn-l + 7 > 0 


for 


(X1,X2, . . 


■ ) Xn) G A \ B, 


and 










5i ■ Xi + 82 ■ X 2 + 


■ • • + Sn-l • Xn-l + 7 < 0 


for 


(X1,X2, . . 


. , x„) G i? \ A. 
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Since f ^ g then A and B must be different. It implies that at least one 
of A \ B and B \ A is nonempty. 

Let us assume that A\B is nonempty (else we can start with the nonempty 
set B \ A). The rest of the proof can be derived on a similar manner as the 
proof of Theorem 1. Further, we have: 

0< ^ (5i • xi + . . . + (5„_i • x„_i + 7 ) 

{x\ ^...,Xn)^A\B 

= 5i- ^ Xi + ... ^ Xn-i + 7- X! ^ 

{xi,...,Xn)^A\B {xi,...,Xn)^A\B {xi,...,Xn)^A\B 

+(5i • x\ + . . . + 5n-i ■ Xn-i + 7 ■ 1 

{x\,...^Xn)^Ar\B {x\,...,Xn)^Ar\B {x\^...^Xn)^Ar\B 

-5i- ^ Xi - - (5„_1 • ^ Xn-l - 7- X! ^ 

{x\,...^Xn)^Ar\B {x\,...^XrL)^Ar\B {x\^...^Xn)^Ar\B 

= 5i- ^ Xi + ... + 5n-i ■ ^ Xn-i + 7 • X! ^ 

/(xi,...,a:„) = l /(a:i,...,a;„) = l /(xi ,. . .,x„) = l 

-5i • ^ Xi - ... - 5„_i • ^ X„_i - 7- ^ 1 

(ici,...,a:n)€SnA (ici,...,a:n)€5nA {x\^...^Xn)^Br\A 

= 5i- ^ xi + . . . + 5„_i • ^ x„_i + 7 • ^ 1 

g(xi,...,Xn) = l g{xi....,Xn) = l g{xi....,Xn) = l 

-5i- ^ Xi - ... -5n-l- ^ X „_1 - 7- ^ 1 

(ici,...,a:n)€SnA (ici,...,a:n)€SnA {x\^...^Xn,)^Br\A 

= 6i- ^ xi + ... +(5„_i- ^ x„_i +7- ^ 1 

{x\,...^Xn)^B {x\,...^Xn)^B {x\ . ,Xn)^B 

-5i- ^ Xi - ... -5n-i- ^ x„_i - 7- ^ 1 

(ici,...,a:n)€SnA (ici,...,a:n)€SnA (ici 

= (5l • ^ Xi + ... +(5„_1- ^ Xn-l + 7- X! ^ 

(ici,...,a:n)€S\z4. {xi,...,Xn)^B\A {xi,...,Xn)GB\A 



(5l • Xi + . . . + (5„_1 • Xn-l + 7 ) ^ 0) 

(xi,...,x„)gB\A 
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The obtained contradiction 0 < 0 implies that (A, A'^) must be equal to 
{B, B‘^). But, that leads to the functional equality /(xi, . . . , x„) = g{x \, . . . , x„), 
for (xi, . . . ,x„) e S. 0 

The coding scheme for the threshold functions defined on {0,1}", proposed 
by the previous theorem, is asymptotically optimal with respect to the usual 
storage complexity criteria. 

Theorem 6. The one-to-one correspondenee between the threshold funetions 
/(xi, X 2 , . . . , x„) defined on a fixed m-point set S C {0,1}" and the set 
of corresponded moments: 

X! X! ^1’ X! X 2 , ... , ^ x„_i, 

= l f(xi,...,Xn) = l /(xi , . . . ,X„ ) = 1 /(xi , . . . ,X„ ) = 1 

enables an encoding of such functions within an 0(n -logm) bit rate per coded 
function. 

If S = {0,1}" (i.e., m = 2"} the required O amount of memory is 

asymptotically optimal. 

Proof. The discrete moments up to the first order are used for the coding. Any 
of them is an integer belonging to {0, 1, 2, . . . , m}. So, any of them can be 
coded with at most [log m] bits, and consequently, the total bit rate is upper 
bounded by 0{n ■ logm). 

In the case of S' = {0, 1}", the storage complexity is O {nfi) . On the other 
side, if LTF{n) denote the number of linear threshold functions of n variables, 
then the following lover bound for the logarithm of LTF{n) is proved in [12] 

log LT F(n) > ■ \1 — , 

V logn 

It implies that the necessary bit rate has as the order of magnitude. Since 
this minimum is reached by our coding scheme, its optimality follows in the 
asymptotic sense. [] 

4 Comments and Conclusion 

Problems related to the encoding of the partitions of a discrete sets (in arbitrary 
dimensions) by a given number of hyperplanes are studied. The obtained re- 
sults can be of interest in different areas: pattern recognition, pattern matching, 
pattern classification, neural computing, image analysis, etc. 

The proposed coding scheme is shown to be very efficient in some cases. Then 
it preserves a fast transmission and a fast comparison between partitioned sets, 
as well as their efficient storage, with a usage of relatively small memory space. 

Let us mention that a problem of enumeration and estimation of the number 
of linear partitions is studied as well. For an illustration we refer to [3], [7], 
and [8]. 
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Abstract. The paper gives strong instability results for a basic recon- 
struction problem of discrete tomography, an area that is particularly 
motivated by demands from material sciences for the reconstruction of 
crystalline structures from images produced by quantitative high resolu- 
tion transmission electron microscopy. In particular, we show that even 
extremely small changes in the data may lead to entirely different solu- 
tions. We will also give some indication of how one can possibly handle 
the ill-posedness of the reconstruction problem in practice. 



1 Introduction 

The field of discrete tomography deals with the retrieval of information about 
discrete objects from typically noisy data. The given data describe (possibly 
weighted) incidences of the object with query sets. Typical query sets are lines, 
planes, or various kinds of windows. The field’s numerous applications range 
from scheduling problems, [GGPOO], questions of data security, [IJ94], tasks in 
image processing, [SG82] and many others, [Rys78], [FLRS91], to the question of 
the reconstruction of crystalline structures from few of their images under high 
resolution transmission electron microscopy, [SKB+93], [KSB+95], [GdVOl]. In 
the present paper we focus on the question of ill-posedness of the latter discrete 
inverse problem. 

The quantitative analysis of high resolution transmission electron microscopic 
images of a given crystal yields essentially the information how many atoms of 
the given object interact with sharply focused electron beams in a given viewing 
direction, [SKB+93], [KSB+95]; see also [GdVOl]. So, in principle, we are given 
the information how many atoms there are on each line parallel to a given small 
number of directions. To be more precise, let n S N, n > 2, let F be a finite 
subset of Z”, let S' be a line through the origin, and let A(S) denote the set 

* Supported in part by the German Federal Ministry of Education and Research Grant 
03-GR7TM1. 
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of all lines of Euclidean n-space E" that are parallel to S. Then the (discrete) 
X-ray of F parallel to S is the function 

XsF : A{S) ^ No = NU{0} 



defined by 

XsF(T) = |Enr| = ^ If (x), 

xGT 

for each T S -4(5'). In the following let iF" = {F : E C Z" A F is finite} and 
= {lin {m} : m € Z" \ {0}}. The elements of F" and £" are called lattice sets 
and lattice lines, respectively. Quite typically, we have some additional a priori 
information available. This can be modeled by considering suitable subsets Q of 
F". 

Given m different lattice lines 5i, . . . , Sm, central questions in discrete to- 
mography are as follows. What kind of information about a finite lattice set F 
can be retrieved from its X-ray images Xs^F, . . . ,Xs„,Fl How difficult is the 
reconstruction algorithmically? How sensitive is the task to data errors? 

The surveys [Gri97], [GdVOl] and the book [HK99] give an overview of the 
known results dealing with the first two questions and their relatives. (The sur- 
veys also include some further explanation for the restriction made here, give 
examples for relevant classes Q and describe the imaging process in more detail.) 

In the present paper we focus on the stability and instability of the recon- 
struction task. Of course, it is clear that small changes in the data can produce 
inconsistency. But this is not really a problem. We will, however, prove the fol- 
lowing two theorems which show that a small change in the data can lead to a 
dramatic change in the image, already in the plane. Here the data is given in 
terms of functions 

f I : A{Si) ^ No, * = !,..., TO 

with finite support, hence the difference of two data functions with respect to 
the same line 5 is a function / : .4(5) — > Z whose size is measured in terms of 
its .^i-norm 

ii/iii= E 

TgA(S) 

Theorem 1. Let m £ N, m > 3, Si,...,Sm £ C? and a G N. Then there 
exist F\,F 2 G with the following properties: 

Fi is uniquely determined by . . . ,Xs„,Fi; 

F 2 is uniquely determined by Xs^F 2 , ■ ■ ■ ,Xs„,F 2 ; 

TZi \\Xs,Fi-Xs,F2h = 2 ( to - 1 ); 

|5i| = IF 2 I > a; 

Fi n F 2 = 0. 



For the case of three directions Theorem 1 shows that even if the X-ray data 
coincide on all but four single lines the corresponding solutions may still be 
completely disjoint. The next result shows that this kind of instability persists 
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even if we weaken our measure of the ‘distance’ from F\ to F 2 by replacing 
\Fi A F 2 \ by the affinely invariant difference 

(5aff(-F'i, F 2 ) := min{|J^i A h{F 2 )\ : h is an affine transformation}. 

For practical purposes the special case of translations is particularly interesting. 
It refers to the situation that we regard an image the same no matter where we 
look at it. As a generalization of Theorem 1 we show the following instability 
even for the much weaker affinely invariant difference. 

Theorem 2. Let m € N with m > 3 let Si, , Sm G be m different lines, 
and let a G N. Then there exist F\,F 2 € with the following properties: 

Fi is uniquely determined by Xs.,Fi, . . . ,Xg^Fi; 

F 2 is uniquely determined by Xs„F 2 , . . . , A 5 ^F 2 / 

Eti \\Xs,Fi-Xs,F 2 \\i = 2(m- 1); 

= I-F 2 I > a; 

Saff{Fi^F2)>\Fi\. 

Our two main theorems will be proved in Section 2. Section 3 will contain a 
rather weak stability result and some remarks towards a possible regularization of 
the ill-posed discrete inverse problem with a view towards our prime application 
in semi-conductor industry. 

2 Proof of Theorems 1 and 2 

We will now prove our main theorems. In [KH98], [KH99] a construction is 
given showing that whenever m > 3 and Si, ... , Sm G there exist arbitrarily 
large irreducible switching components. Our construction can be regarded as a 
modification of a generalization of this result. In the following we are not aiming 
at smallest possible sets Fi , F 2 but try to give the most transparent construction 
that covers both, Theorem 1 and Theorem 2 simultaneously. 

Throughout this section, let m > 3 and Si, ... , Sm € be m different lattice 
lines. Let vi, . . . ,Vm G such that Si = lin {ui} for i = 1, . . . , m. We may as- 
sume without loss of generality that vi = (1, 0)^ and V 2 = (0, 1)^; see [GGP99]. 

2.1 Some Technical Lemmas on Polynomials 

Our construction given in Subsection 2.2 is based on two functionals. Rather 
than giving it in all possible generality we will use two specific polynomials p 
and q here, defined by 

p{t) = t^ + t^ + t^ and q{t) = + t"^. 

In addition, we use a parameter cj S N that will be fixed later. The functionals 
are then of the form 



/,^(x) = ujp{x) and 5 ^(x) = q{x/to). 



Let us begin with a simple remark. 
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Remark 3. For any w e N, 

(a) U^),gu.{co-N)cN; 

(b) fcj and guj are strictly increasing on [0,oo[ whence invertible; 

(c) fuj{t) > gZ^{t) for every t g]1,oo[. 

Next we prove auxilliary results on certain functional equations involving 
and gui- 

Lemma 1. Let a, /3, 'y,6,a,r G R such that a5 — fi'; ^ 0. 

(a) Then the equation 

at + (3fu;{t) + a = g^{'^t + Sf^{t) + r) 

holds for at most 30 different values oft. 

(b) The equation 

fu:{at + Pfu{t) + cr) = 7t + 6f^{t) + T 

holds for all t € M. if and only if a = 6 = 1 and /3 = 7 = ct = t = 0. 
Otherwise it holds for at most 25 different values oft. 

(c) The equation 

agu,{f) + f3t + a = gui{igu){t) + 5t + T) 

holds for all t & M. if and only if a = 6 = 1 and /3 = ^ = a = T = 0. 
Otherwise it holds for at most 36 different values oft. 

Proof. To prove assertion (a) just note that deg(( 7 ;j) = 6 > deg{f^) = 5 > 2 
implies that the equation cannot hold as an identity. Hence there are at most 
deg (fuj) ■ deg (g^j) many solutions. 

Similarly, if (b) holds identically, P = 0, and we have the equation 

fu;{at + cr) =jt + Sf^{t) + T. 

Taking the k th derivative on both sides for k = 2, 3,4, 5 yields the condition 

a>^fi>^\at + a)=Sfi>^\t). 

The equation for k = 5 yields = 6, which implies <5 = 1 if a = 1 . Suppose that 
a yf 1. By choosing to = cr/(l — a) yf 1/5 we obtain a® = «■* = S. For to = 1/5 it 
follows that f^f'"\to) = 0, but 7^ 0. Thus, a® = = <5. In any case, this 

implies a = = 1. Taking the fourth derivative shows now that cr = 0. Plugging 

t = 0 into the original equation yields r = 0, and finally 7 = 0. 

Of course, (c) follows similarly. □ 
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2.2 The Basic Construction 

Now let fc e N such that > a, where a is the positive integer of Theorem 1 

or Theorem 2. For the proof of Theorem 2 we may assume that a > We 

begin by constructing two lattice sets Um and Kn with \Um \ = |Kn| = > a 

that are tomographically equivalent with respect to S'!, , Sm i.e., they have the 
same X-rays parallel to Si, , Sm- 
Let ui >2, set 

Xi=gMM)), 

and define 

Ai-i-i = goj(/i.j(Ai)) for i = 2, . . . ,fc - 1. 

Remark 4. (a) The sequence is independent of the specific choice 

ofi^; 

(b) Ai,...,AfcGN 

(c) Ai < • •• < Afe. 



Now let 



and 

Observe that the points of Ck-i are all of the form {t, gif^{t))^ . 

The two sets Bk and Ck are tomographically equivalent with respect to Si 
and 5*2. The rest of the construction will depend on suitably chosen positive 
integers 03 , . . . , G N which will be fixed later. For each additional line in , 
the sizes of the sets are doubled. For j = 3, . . . , m we define 

t/2 = Bk V2 = Ck 

~ (^'-1 + % o ) ^3 ~ ^-1 (^ j-i + % o )- 

Clearly, we have the following properties. 

Remark 5. For each cu the integers 03 , ... , 9m can be chosen suitably large, so 
that Um n Vm = 07 = \Um\ = 2 ™“^ • k, and Um and Vm are tomographically 

equivalent with respect to Si, ... , Sm ■ 

The required sets Fi and F 2 are now constructed by removing a point of Um 
and Vm each. In fact, we choose points zq G Um and zi G Vm that lie on the 
same line parallel to Si and set 



Fl = Um\ {^O}, F2 = Vm\ {^l}. 
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Note that the construction shows that 



FinJ^2=0 and |Fi| = |F2|=2™-2.fc-l. 



Clearly, 



m 



Y^WXs.Fi - XsMli = 2(m - 1). 



Our theorems will follow if we can show that the integers u> and 9^, ... ,0m 
can be chosen in such a way that no other finite lattice set is tomographically 
equivalent to Um and Vm with respect to Si, ... , Sm, and Um and Vm are such 
that even affine transformations cannot place them ‘too closely’ on top of each 
other. 

In fact , from the former property which will be derived in Lemma 3 it follows 
that Fi is uniquely determined by its X-rays parallel to Si, ... , Sm- Indeed, if 
F G is tomographically equivalent to Fi, then Zq ^ F, and U {zq} is tomo- 
graphically equivalent to Um- Since FU{zo} 7^ Vm we must have FuIzq} = Um 
hence Fi = F. Analogously, F2 is uniquely determined by X$^F2, - - . ,X$^F2- 

2.3 Properties of U-m and V-m 

Now we derive some crucial properties of Um and Vm for appropriate choices 
of the parameters. First note that every set that is tomographically equivalent 
to Uj or, what is the same, to Vj is contained in the grid of Uj 



Lemma 2. The integer oj can he chosen in such a way that there is no line 
parallel to S3, ... , Sm which intersects G2 in more than one point. 

Proof. Suppose there are indices ii,i2,ii,j2 G with ii yf *2, ji J2, 

and V G {v3, . . . ,Vm}, and there exists a real p such that 



j 



G, =z^n f| U (x + 5,), 



i—1 x^Uj 



for j = 2 , . . . , m. Of course. 



G2 = {(A„/^(A,))^ : i,j = l,...,k}. 




By Remark 4 (a) this equation is equivalent to 
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where v = The rank of the 2x3 matrix is 2, hence its nullspace 

is of dimension 1. On the other hand, for any two different numbers oji,uj 2 
and arbitrary the vectors (1, wi, and (1, W 2 , 1 * 2 )^ are always linearly 

independent, whence two such vectors cannot be in the nullspace simultaneously. 
Taking all possible quadruples of indices * 1,120102 G {1, • ■ • i fc} leads then to 
the union of k^{k — 1)^ one-dimensional subspaces that we have to avoid. Hence 
among any set of/c^(fc — different values of w there must be one for which 

none of the vectors lie in any of the nullspaces. □ 

Note that the result of Lemma 2 is invariant with respect to translations 
of G 2 . In the following we assume that oj is fixed according to Lemma 2. The 
next lemma implies Theorem 1 already. 

Lemma 3. The integers 63 , ... , 6 m can be chosen large enough such that no 
other finite lattice set is tomographically equivalent to Um cmd Vm with respect 
to Si , ... , Sm . 

Proof. Since Gm is the grid of Um and Vm we clearly have Um U l^n C Gm- 
Setting = G 2 , it is also clear that we can choose 63, ... ,9m large enough 
such that 

G, C G(*) = U {9,v^ + 

for i = 3, . . . , m. For a formal proof see [GGP99, Proof of Lemma 3.1]. 

The assertion will now follow if we make sure that Gm = Um U Vm since the 
only two subsets of Um U Vm that are tomographically equivalent to Um or Vm are 
precisely these two sets. We will show that for each point g G G^"^^\{Um'SVm) the 
line g + S 3 does not intersect Um U Kn- So, let g G G^™) \ {Um U Vm) and suppose 
g + S3 intersects Um U l^n- Of course, Um U Vm consists of disjoint translates 
of U 2 U V 2 , with translation vectors of the form with Si, ... ,5m G 

{0, 1}. Let ti be such a translation vector with g G ti + G 2 . By our assumption 
there is another such translation vector ^2 with {g + S3) n (^2 + G 2 ) yf 0. By 
Lemma 2 no line parallel to S3 intersects ti + G 2 and ^2 + G 2 in more than one 
point each. Hence these points must be g and (^2 — ti) + g. But g ^ t\ + {U 2 U V 2 ) 
hence (O — G) + 3 ^ ^2 + (G 2 U p 2 ). This contradiction concludes the proof of 
the assertion. □ 

2.4 The Affinely Invariant Difference 

In order to prove Theorem 2 we need to study the power of affine transformations 
on F 2 to reduce the symmetric difference to Fi . The next result is a corollary to 
Lemma 1. 

Lemma 4. Let A G be nonsingular, let a G R^, and let h be the affine 

transformation defined by h{x) = Ax + a. Then 

(a) |Hfcn/i(Gfc)| <31; 

(b) \Bk n h{Bk)\ >26^h = id; 

(c) \Gk n h{Ck)\ > 39 h = id. 
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Proof. Since the one point in Ck \ Ck-i is of slightly different form than the 
points of Bk U Ck-i we always count it as ‘matched’. Then the assertion follows 
easily from Lemma 1. □ 

Lemma 4 shows already that the ‘overlay’ power of affine transformations is 
rather limited. In fact, suppose that A is not the unit matrix i.e., h is not just a 
translation. If we use again that Um^Vm consists of disjoint translates of U2UV2 
with translation vectors of the form with 5\, ... ,5m G {0, 1}, and of 

course that U2 = Bk and V2 = Ck then Lemma 4 implies that 



IMc^m u Kn) n (C 7 „ u Kn)| 

< 2™-2 . 2™-2 . {\h{U2) n C/2I + \h{U2) n F2I + \h{V2) n C/2I + \h{V2) n V2\) 

< 125 • < 125 • a • 2"® < < i|Fi|. 

So it remains to study the effect of translations. To do this appropriately we 
require one more property of the numbers 63, ... , 6m- It is most easily stated 
with the help of a geometric functional, the breadth by{S) of a planar point set S 
in a given direction v G ||z ;||2 = 1; it is defined by 

bv(S) = max — min 
x^S x^S 

We choose the integers 63,. .. ,9m so that 

\\6iViW2 > by^{Ui-i U Vi-i) for * = 3 , . . . ,m, 

where Vi = Vi/||fi|| 2 - Of course, this choice does not interfere with the previous 
requirements. 

Now let t be a translation vector. Suppose first that 
Vmt > bv„,{Um-l U Vm-l). 



This means on the one hand that t has the capability of moving a point of t/m-iU 
Vm-i into 9mVm + {Um-1 U Vm-l). But On the other hand 



Hence 



{Um-l U Vm-l) n + {Um-1 U Vm-l)) = 

>\Fi\ 



{UmUVm) A (t+{UmUVm)) 
which is the assertion. So we can assume that 
{UmUVm) A (t+{Um'JVm)) 

= (^{Um-l^Vm-l)r\ (t+ (Um-l^Vm-l))'^^ 

U ^(dmVm + {Um-1 U Vm-l)) H + dmVm + {Um-1 U • 



Hence by an induction argument we can reduce the assertion to Bk U Ck, and it 
suffices to show that |Hfc n (t + Cfe)| < This follows from Lemma 4, which 

concludes the proof of Theorem 2. 
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3 Dealing with Instability in Practice 

The approximation algorithms given in [GdVWOO] show that the ‘combinato- 
rial optimization part’ of the reconstruction problem of discrete tomography can 
be handled very efficiently in spite of its computational complexity. In particu- 
lar, [GdVWOO] considered the problem Best-Inner-Fit(5'i, . . . , Sm) [BIF] that 
accepts as input data functions /i, . . . , and its task is to find a set F € F" 
of maximal cardinality such that 

XsiF{T) < f,{T) for all T e and i = 1, . . . , m. 

The results show that in spite of the underlying NP-hardness some simple heuris- 
tics already yield good a priori approximation guarantees and behave surprisingly 
well in practice. In fact a dynamic greedy/postprocessing strategy actually pro- 
duced solutions with small absolute error, for instances with 250000 variables 
and point density of 50% an average of -l-only 1.07, 23.28, 64.58 atoms were 
missing for 3, 4, 5 directions, respectively. 

As we have seen the reconstruction task of discrete tomography is intrinsically 
instable. There is however the following (rather weak) stability result. 

Lemma 5. Let F\ € F" and let fi, , fm be given (generally noisy) data func- 
tions. Then there exists a finite lattice set F 2 with Xs^F 2 {T) < fi(T) for all 
T S -4(5') and i = 1, . . . , m such that 



|Fi AF 2 I <^|j/,-A5,Fi||i. 

Proof. F 2 is constructed from Fi by deleting at most Wfi~^SiFi\\i points 

in order to satisfy the constraints XsiF 2 {T) < fi{T) for all T G -4(5) and 
i = 1, . . . , m. □ 

Lemma 5 shows in particular that under data errors the cardinality of a 
solution behaves in a stable way. Also it shows that even with noisy data there 
is always a solution of [BIF] that is ‘close’ to the original object in terms of the 
symmetric difference. (Of course, it is not even clear how to recognize a ‘good’ 
solution if the original image is not known, let alone how to find one.) In view of 
Theorem 1 this means that it is generally not a good strategy in practice to try 
to satisfy the measured constraints as well as possible. In fact. Theorem I shows 
that the ‘perfect solution’ for the noisy data may be disjoint from the original 
image while by Lemma 5 a ‘nonperfect solution’ may be quite close. 

In any case, it is important to utilize any additional physical knowledge and 
experience available in order to actually produce solutions that are close to the 
original physical objects. For the task of quality control in semi-conductor in- 
dustry we can, e.g., utilize the fact that we know the ‘perfect’ object in advance. 
Suppose we are given a few images under high resolution transmission electron 
microscopy of parts of a silicon wafer that carries an etched pattern of some 
electrical circuit. Our instability results seem to suggest that there is not much 
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hope for a reasonably good reconstruction of the object from these measure- 
ments since all physical measurements are noisy. On the other hand, we do not 
need to reconstruct the object ‘from scratch’. We know what the object would 
look like if the production process had been exact and the measurements were 
correct. Hence it seems reasonable to measure the distance from the theoretical 
perfect chip. This means we are trying to reconstruct a finite lattice set approx- 
imately satisfying the (noisy) X-ray measurements so that its distance from the 
perfect template is as small as possible. If there does not exist a good enough 
approximation, we know that the chip is faulty. But if there exists a good enough 
approximation, all we know is that the data from high resolution transmission 
electron microscopy is not sufficient to rule out the possibility that the chip is 
correct. Let us look at the following simple example of two images Fi and F 2 
depicted in Figure 1. 



The corresponding grid G has the size 40 x 40; whence the reconstruction 
problem involves 1600 0- 1-variables. The sets F\ and F 2 are depicted as fat 
black points (•); the empty places are represented as small dots (•). Template F\ 
is regarded as proper, while template F 2 is faulty since the conductor path is 
blocked. Now we consider X-rays parallel to the lines Si = lin{(l,0)^}, S 2 = 
lin {(0, 1)^} and S 3 = lin {(1, 1)^}. Note that Fi and F 2 are uniquely determined 
by their X-rays parallel to these lines. 



However, as shown in Figure 2 there are finite lattice sets F{ and F 2 with 

-1 < Xs,Fi(T) - Xs,F{{T) < 1 for each T € i = l,2, 3, 

and -1 < Xs,F 2 iT) - Xs,F^{T) < 1 for each T € ^4(5',), f = 1, 2, 3 



(a) phantom Fi (b) phantom F 2 

Fig. 1 . Two lattice sets in a 40 x 40 grid 
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(a) Fi (b) Fi 

Fig. 2. Reconstructions of Fi and F 2 , respectively, based on data with error at 
most one on each line 



that do not share the same ‘conductor path structure’. Also, F{ and F2 look 
similar and it does not seem justified to distinguish a ‘proper chip’ from a ‘faulty’ 
one based on this reconstruction. But after all, the goal is only to see whether 
there is a chance to detect faulty chips even in the presence of considerable 
data errors. And in fact, if the error on each X-ray line is reasonably small such 
information can be derived. Let e € N, and suppose that for j = 1,2 the data 
functions f(, f 2 -, fi are measured with 

-e < Xs,Fj{T) - fi <t for each T G A(S',), i = 1, 2, 3, j = 1, 2. 

We pursue an approach motivated by Lemma 5, choose some 5 G N and deter- 
mine two sets F^ and that minimize the symmetric difference to F\ under 
the constraints that 



-5 < Xs,F^{T) - fi <S for each T G A(Si), i = 1, 2, 3, j = 1, 2. 

Clearly, if 5 > e, the set F^ coincides with the template Fi. Further, if 

(5-be< max \\Xs,Fi - Xs,F 2 \\oo, 
i=l,2,3 

then F^ ^ F\. In our specific example this is for instance the case when <5 = e = 3. 
The question is whether this trivial observation which is only based on the ‘local’ 
information of what happens on single X-ray lines can be substantially extended 
when all the given information is taken into account and if the error is regarded 
as random variable with some given underlying distribution. Also, what can be 
said for real-world data? These questions will require further theoretical investi- 
gations and an extended experimental study for the above and other conceivable 
regularization techniques. 
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Abstract. Nearly 40 years ago Hough showed how a point-to-line map- 
ping that takes collinear points into concurrent lines can be used to detect 
collinear sets of points, since such points give rise to peaks where the cor- 
responding lines intersect. Over the past 30 years many variations and 
generalizations of Hough’s idea have been proposed. Hough’s mapping 
was linear, but most or all of the mappings studied since then have been 
nonlinear, and take collinear points into concurrent curves rather than 
concurrent lines; little or no work has appeared in the pattern recognition 
literature on mappings that take points into lines. 

This paper deals with point-to-line mappings in the real projective plane. 
(We work in the projective plane to avoid the need to deal with special 
cases in which collinear points are mapped into parallel, rather than con- 
current, lines.) We review basic properties of linear point-to-point map- 
pings (collineations) and point-to-line mappings (correlations), and show 
that any one-to-one point-to-line mapping that takes collinear points into 
concurrent lines must in fact be linear. We describe ways in which the 
matrices of such mappings can be put into canonical form, and show 
that Hough’s mapping is only one of a large class of inequivalent map- 
pings. We show that any one-to-one point-to-line mapping that has an 
incidence-symmetry property must be linear and must have a symmetric 
matrix which has a diagonal canonical form. We establish useful geomet- 
ric properties of such mappings, especially in cases where their matrices 
dehne nonempty conics. 



1 Introduction 

In 1962 Hough [37] patented a method of detecting collinear sets of points in an 
image by applying a linear point-to-line mapping to the image. Such a mapping, 
which in projective geometry is called a correlation (see Section 3), takes collinear 
points (points that all lie on the same line) into concurrent lines (lines that all 
pass through the same point). Thus if the image contains n collinear points 
the mapping takes these points into n concurrent lines; hence if the mapping is 
additive it produces a peak of height n at the point where these lines intersect. 

Over the past 30 years many variations and generalizations of Hough’s idea 
have been proposed. These methods generally involve (not necessarily linear) 
mappings / from the plane into some space, such that if many points of the plane 
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belong to a locus of a given form, their images under / give rise to a peak in the 
space. Such mappings are called Hough transforms; for reviews of the literature 
on these transforms see [54,38,44]. Hough’s original mapping was linear, but as 
we will see in Section 2 the Hough transforms developed afterward were nonlinear 
and map collinear points into concurrent curves rather than concurrent lines. 
Thus since Hough’s original patent little or no work on point-to-line mappings 
has appeared in the pattern recognition literature. 

We will not attempt here to develop a general theory of nonlinear Hough 
transforms; we will restrict ourselves to (not necessarily linear) point-to-line 
mappings from the plane into itself. To avoid special cases in which collinear 
points map into parallel lines, we will work in the real projective plane. To make 
the paper more widely accessible, in Section 3 we will provide a brief introduction 
to the real projective plane and to point-to-point and point-to-line mappings of 
the projective plane into itself. A point-to-line mapping need not be linear even 
if it is one-to-one, but it can be shown that if it takes collinear points into 
concurrent lines it must be linear; thus any point-to-line Hough transform must 
in fact be linear. 

In Section 4 we will review some basic properties of linear point-to-line map- 
pings, with emphasis on point-line incidence properties of the mappings and on 
symmetry properties of their matrices. We will also discuss canonical forms for 
the matrices of such mappings and will show that Hough’s mapping is only one 
of a large class of inequivalent mappings. We will show that a (not necessarily 
linear) one-to-one point-to-line mapping that has an incidence-symmetry prop- 
erty must be linear and must have a symmetric matrix; such mappings have 
diagonal canonical matrices and simple geometric properties, particularly when 
their matrices define nonempty conics. 



2 Classical (Linear and Nonlinear) Hough Transforms 

In the point-to-line mapping used by Hough in his patent [37] the point (rt, v) 
was mapped into the line x = vy + u; evidently this mapping is linear. The 
mapping is given in [56] p. 123 in a slightly different form, as taking (u, v) into 
the line y = —ux+v; evidently this too is a linear mapping. (Canonical forms for 
these two versions of Hough’s mapping will be given in Section 4.2; interestingly, 
they are not identical.) Note that in the first mapping the coordinates of the 
point determine the x-intercept of the line and the reciprocal of its slope, and 
in the second mapping they determine the line’s y-intercept and the negative of 
its slope. 

The first paper to introduce the term “Hough transform” [23], published 
nearly 30 years ago, criticized Hough’s definition because it used a parameteri- 
zation of the line in which the parameters can become infinite. (The slope and 
y-intercept of a vertical line are infinite, and the reciprocal slope and x-intercept 
of a horizontal line are infinite; thus for both x = vy + u and y = —ux + v, 
both parameters are infinite either for vertical lines or for horizontal lines.) It 
was therefore proposed in [23] to use the so-called “normal form” for lines rather 
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than a slope-intercept form. In the normal form a line is parameterized by the 
slope angle 6 of its normal (which takes values in the bounded range [0, tt); the 
slope of the normal is tan 6) and its distance p from the origin (which is bounded 
for lines that intersect an image of bounded size) . In this parameterization the 
equation of the line is xcos0 -I- ysin6 = p; the point-to-line mapping used in [23] 
mapped the point (0, p) into this line. Evidently this is not a linear mapping; 
the coefficients of x and y are sinusoidal rather than linear functions of the slope 
angle 0. Also, this mapping usually does not take collinear points into concurrent 
lines; for example, if p is a nonzero constant the points (0,p) go into the lines 
that are tangent to a circle of radius p centered at the origin, and these lines are 
not concurrent. However, it can be shown that if the points Xi, yi in the xp-plane 
are collinear then the sinusoidal curves p = XiCOS0 + yiCOS0 in the 0p-plane are 
concurrent; thus this mapping takes collinear points into concurrent sinusoids 
which give rise to a peak in the 0p-plane where they intersect. 

Several other Hough transform definitions were proposed during the 1980’s; 
they too make use of nonlinear point-to-line mappings. In the transform defined 
in [66] a line is mapped into its intercepts with the sides of an upright square 
(e.g., the sides of the image); evidently these intercepts are of bounded sizes. Let 
the sides of the square lie on the positive x and y axes, let their length be s, 
and let the equation of the line be y = mx -|- &. If 0 < m < cxd and 0 < 6 < s 
this line has intercepts m and (s — h) /m with the left and upper sides of the 
square; the formulas for the intercepts are similar for other values of m and h such 
that the line intersects the square. Here again the mapping that takes the point 
(m, (s — b) /m) into the line y = mx+b is not linear; indeed b = —m[{s — b) /m]-|-s, 
so 6 is a linear function of the product of the intercepts. Similar remarks apply to 
the variant of this transform defined in [26]. The same is true for the transform 
defined in [21], which maps the line y = mx -I- b into the coordinates of the 
foot of the perpendicular from the origin onto that line. It is easily seen that 
the coordinates of this point are {bsm0cos0, bcos^0) where tan0 = m; here again 
the coefficients of the line are not linear functions of these parameters. Thus 
the classical Hough transforms [21,23,26,66] all involve nonlinear point-to-line 
mappings. 

Although these point-to-line mappings are nonlinear they are still one-to- 
one; a line is uniquely defined by the parameters of its normal form or by the 
coordinates of the foot of the perpendicular to it from the origin, and a line 
that crosses an image is uniquely defined by its intercepts with the sides of the 
image. In Section 3.3 we will show that if a (not necessarily linear) point-to-line 
mapping is one-to-one and takes collinear points into concurrent lines it must in 
fact be linear. 

If a point-to-line mapping takes collinear points into concurrent lines we can 
use it to detect sets of collinear points even if we discard the lines and keep only 
the “peaks” where sets of the lines intersect. Evidently even if the point-to-line 
mapping is one-to-one, the mapping that takes the points into the positions and 
heights of the peaks is many-to-one; many different point patterns that include 
sets of collinear points can give rise to the same set of peaks, since the height and 
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position of a peak depend only on the number of collinear points in a set and the 
equation of the line on which they lie. It can be shown [57] that if the points all lie 
on the sides of a convex polygon then the polygon is uniquely determined by the 
set of peaks, but this is not true if the polygon is not convex. (Actually, [57] did 
not deal with discrete sets of points on the sides of the polygon; it assumed that 
the points lying on a line segment give rise to a peak whose height is proportional 
to the length of the segment.) For further discussion of the non-uniqueness of the 
correspondence between patterns of points (or line segments) and the patterns 
of peaks in their Hough transforms see [43,39,1]. 

3 Linear Point-to-Point and Point-to-Line Mappings in 
the Real Projective Plane 

In this paper we study (not necessarily linear) point-to-line mappings from the 
projective plane into itself. In Section 3.1 we present a brief, informal introduc- 
tion to the real projective plane, and in Section 3.2 we introduce linear mappings 
of the projective plane into itself that take points into points ( “collineations” ) or 
points into lines ( “correlations” ) . It is known that if a one-to-one point-to-point 
mapping of the projective plane into itself preserves collinearity it must be lin- 
ear. In Section 3.3 we use this result to prove (Theorem 1) that if a one-to-one 
point-to-line mapping of the projective plane into itself takes collinear points 
into concurrent lines it must be linear. 



3.1 Points and Lines; The Projective Plane; Homogeneous 
Coordinates 

In the Euclidean plane a point can be specified by a pair of real-number co- 
ordinates (x,y), and a line can be specified as a set of points that satisfy a 
linear equation ax + by + c = 0 where the coefficients a, b, c are real numbers 
and a, b are not both zero. Any two distinct points are contained in a unique 
line; specifically, the line containing the points (r, s) and (m, v) has the equation 
{v — s)x — {u — r)y + (us — vr) = 0. Any two distinct lines intersect in at most 
one point, but parallel lines (i.e. lines ax + by + c = Q and dx + ey + f = Q for 
which the coefficients a and d, b and e are proportional) do not intersect. 

The relationship between points and lines can be made symmetric by ad- 
joining “points at infinity” to the plane and specifying that any two parallel 
lines intersect at such a point. This can be done without introducing infinite 
values for the coordinates of points. Instead, we introduce a homogeneous coor- 
dinate system in which a point is specified by a set of proportional triples of real 
numbers (i.e., if (r, s,t) and (u,v,w) are proportional we say that they specify 
the same point) and we require that the elements of a triple not all be zero. In 
this coordinate system, if t yf 0 the triple (r, s, t) represents the “finite” point 
(x,y) = (r/t,s/t); if t = 0,(r, s,t) represents a “point at infinity”. The set of 
points defined by triples in this way is called the real projective plane. 
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When we use homogeneous coordinates lines are still specified by linear equa- 
tions; a line is a set of triples {x,y^z), not all zero, that satisfy an equation of 
the form ax + by + cz = 0 where a,b,c are not all zero. Note that if (r,s,t) 
satisfies this equation so does any triple proportional to (r, s,f); thus a line is 
still a set of points. Moreover, if two triples of coefficients (a, b, c) and {d, e, /) 
are proportional they evidently define the same line; thus lines too are defined 
by triples of homogeneous “coordinates” (their coefficients). It is not hard to 
see that any two distinct (non-proportional) points are contained in a unique 
line and any two distinct lines (=lines whose coefficients are not proportional) 
contain (i.e., intersect in) a unique point. 

To summarize: In homogeneous coordinates both points and lines are de- 
fined by triples of real numbers (coordinates or coefficients) not all of which are 
zero, where proportional triples are regarded as defining the same point or line. 
Because of this symmetric relationship between points and lines the projective 
plane can be regarded as either a set of points or a set of lines, and if a statement 
about points is true (e.g., that the union of any two distinct points is contained in 
a unique line) then the “dual” statement about lines (e.g., that the intersection 
of any two distinct lines contains a unique point) is also true. 

3.2 Collineations and Correlations 

A linear mapping of the projective plane into itself is called a collineation. Be- 
cause of the symmetric relationship between points and lines such a mapping can 
be regarded as either taking points into points or lines into lines. The symme- 
try between points and lines also allows us to define linear mappings that take 
points into lines or vice versa; such a mapping is called a correlation. The terms 
“collineation” and “correlation” were introduced by Mobius in 1827. (Note that 
in signal processing and statistics the word “correlation” has rather different 
meanings.) Our main interest in this paper is in correlations, but we will occa- 
sionally make use of basic facts about collineations. Note that statements about 
correlations are not the “duals” of statements about collineations. 

In any given homogeneous coordinate system a correlation / : x — > u can 
be written in vector-matrix notation as u = Ax where A is a 3 x 3 matrix and 
X and u are 3x1 matrices (“column vectors”) representing the coordinates of 
X and the coefficients of u respectively. Evidently two matrices define the same 
correlation / (with respect to a given coordinate system) iff one is a nonzero 
scalar multiple of the other. If A defines / we should call it “a” matrix of /, but 
we will sometimes refer to it as “the” matrix of / if this will not cause confusion. 

Evidently a collineation or correlation is one-to-one (i.e. invertible) iff its 
matrix (more precisely: any of its matrices) is nonsingular. The invertible 
collineations evidently form a group under composition of functions. The invert- 
ible correlations cannot form a group since they are not closed under composition 
of functions, but together with the invertible collineations they do form a group. 

As an example, we saw in Section 2 that the point-to-line mapping used by 
Hough [37] mapped the point (u, v) into the line x = vy + u. In homogeneous 
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coordinates this mapping takes the point into the line [— Evi- 

/00-l\ 

dently this mapping is a correlation and has matrix 0 1 0 . Note that this 

matrix has determinant 1, so the mapping is invertible. 

Collineations and correlations are treated in detail in books on (projective) 
geometry such as [5,12,13,14,15,16,18,19,20,28,29,30,33,35,36,42,45,46,48,55,59] 
[60,61,62,64,65,67]; treatments of the subject intended for use by computer vision 
researchers can be found in [31,40,41,52]. Correlations, in particular, are treated 
in [5] pp. 186-187; [12] pp.58, 63; [13] pp.30-31, 74; [14] pp.254-259, 262-263; [15] 
pp.90-93; [16] pp.55-61; [17] pp.179-188; [18] pp.51-53, 85; [19] pp.165-171; [20] 
pp.49, 57, 98; [27] pp.265-270; [28] pp.339-342; [29] pp.l68, 326-334; [30] pp.79, 
102; [31] p.39; [35] Ch.9, pp.362-429; [40] p.28 (see also [41]); [42] pp.234-235; [45] 
pp.229-235; [48] pp.l36, 268; [52] p.483; [55] pp.l98, 280, 288; [59] pp.224-230; 
[60] p.171; [61] pp.89-90; [62] p.24; [64] pp.262-267, 278-280; [65] pp.77-86; [67] 
pp. 88-94. 



3.3 A Point-to-Line Hough Transform Must Be Linear 

Let / be a mapping (not necessarily linear) defined on the real projective plane 
that takes points into lines. Even if / is a one-to-one correspondence and hence 
is invertible, it need not be linear. For example, if / takes the point (m, v, w) 
into the line whose coefficients are {vw,wu,uv) (see [46] p.40, and [62] pp. 74- 
76, where this mapping is called a polarity with respect to a triangle) it is 
invertible but not linear; this is the simplest nontrivial example of an invertible 
Cremona transformation (see [59] p.231). The same is true if / takes (u,v,w) 
into (m^, since on the real projective plane this mapping too is invertible. 

However, as we shall now show, if / is a one-to-one point-to-line correspondence 
that takes collinear points into concurrent lines it must be linear. 

It is known that a one-to-one point-to-point correspondence preserves 
collinearity iff it is linear; dually, the same is true for a one-to-one line-to-line 
correspondence that preserves concurrence. Proofs of this theorem can be found 
in many books on projective geometry; for examples see [4] p.88 (where it is 
called the Fundamental Theorem of Projective Geometry), [12] p.57, [29] p.l09, 
and [45] p.l52. Interestingly, none of these books mentions the corresponding 
result about point-to-line correspondences (which is NOT the dual of the point- 
to-point or line-to-line result); we therefore prove it here using an argument 
based on [18] p.85. 

Note first that if / is a one-to-one point-to-line correspondence is a one- 
to-one line-to-point correspondence, and if / and g are one-to-one point-to-line 
correspondences f~^ o ^ is a one-to-one point-to-point correspondence. Let /o 
be any specific linear one-to-one point-to-line correspondence — for example, the 
“identity” mapping that takes the point {u,v,w) into the line whose coefficients 
are (m, v, w); note that this particular /o is defined by the 3x3 identity matrix I. 
Then for any one-to-one point-to-line correspondence / we have / = fo°ifo^°f)-> 
where /(j”^ o / is a one-to-one point-to-point correspondence. Evidently a linear 



Point-to-Line Mappings and Hough Transforms 



193 



point-to-line correspondence maps collinear points into concurrent lines and vice 
versa; hence if / maps collinear points into concurrent lines o f preserves 
collinearity. Hence by the theorem stated in the preceding paragraph /(7^ ° / is 
linear. Since /o is linear / = /q o o /) must also be linear. We have thus 
proved 

Theorem 1. A one-to-one point-to-line correspondence takes collinear points 
into concurrent lines iff it is linear. 

Since a point-to-line Hough transform must take collinear points into concurrent 
lines, we have thus proved that any such transform must be linear. As we saw in 
Section 2 the “classical” nonlinear Hough transforms take collinear points into 
concurrent curves, not into concurrent lines. 

4 Linear and Non-linear Point-to-Line Mappings 

In Section 4.1 we review some basic properties of linear point-to-line mappings 
(correlations), with emphasis on point-line incidence properties of the mappings 
and symmetry properties of their matrices; in particular, we discuss the relation- 
ship between polarities (correlations whose matrices are symmetric) and conics. 
In Section 4.2 we discuss how to define canonical forms for the matrices of corre- 
lations on the real projective plane. In Section 4.3 we study point-line incidence 
properties of point-to-line mappings that are not necessarily linear, and we prove 
(Theorem 3) that a one-to-one point-to-line mapping that satisfies an incidence- 
symmetry property must in fact be linear and must have a symmetric matrix. 

4.1 Linear Correlations 

Let / be an invertible linear point-to-line mapping, say / : x ^ u = Ax. The 
dual of / is the line-to-point mapping g : u ^ x = (A^^)^u , where T denotes 
matrix transpose. Note that g is not the same as the inverse of / unless A (and 
A^^) are symmetric. We begin this section by observing that / and its dual 
have a useful “incidence-preservation” property (see also Section 4.3): 

Proposition 1. A point P lies on a line L iff the point g{L) lies on the line 

HP)- 

Proof. Let P and L have homogeneous coordinates x = [xo,a;i,X 2 ]^ and u = 
[■uo, ui, ^ 2 ]^ respectively and let A be the matrix of /. Thus f{P) = Ax and 
g{L) = (A^i)^u. Since P lies on L we have u^x = 0; taking the transpose gives 
x^n = 0. This can be rewritten as x^[A^(A^^)^]u = 0, i.e. (Ax)^[(A~^)^u] 
= 0, which implies that g(L) lies on f{P). Conversely (Ax)^[(A^^)^u] = x^[A^ 
(A^^)"^]u = x^n; hence if the left side is 0 so is the right side, and taking the 
transpose gives u^x = 0. □ 

Our next two propositions are concerned with situations in which P lies on 
/(P) where / is not necessarily invertible. 
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Proposition 2. The point P = x lies on the line f{P) iffx satisfies the homo- 
geneous quadratic equation x^A^x = 0, where A is the matrix of f . 

Proof P = X and /(P) = Ax; hence P lies on /(P) if and only if (Ax)^x = 0, 
i.e. x^A^x = 0. □ 

If / is invertible and g is the dual of / we can show similarly that the line 
L = u passes through the point g{L) iff u satisfies the homogeneous quadratic 
equation u^(A"^)^u = 0. Note that x^A^x is the transpose of x^Ax; thus 
the quadratic loci x^Ax = 0 and x^A^x = 0 are always the same. Hence the 
quadratic loci x^A^x = 0 and u^(A^^)^u = 0 are the same iff A = (A~^)^, 
i.e. A^^ = A^, which means that A is orthogonal. 

If A = {oij) where 0 < i < 2 and 0 < j < 2, x’^A’^x is aooa^o+anxf + 022 X 2 + 
(ooi + oio)xoxi + (oi 2 + 02 i)xiX 2 + (o 2 o + oo 2 )x 2 Xo. If A is skew-symmeti'ic the 
coefficients of this quadratic expression are all zero; thus x^A^x is identically 
zero for every x. Conversely, if the quadratic is identically zero, taking x = 
[1,0,0]^ shows that we must have ooo = 0; similarly, taking x = [0,1,0]^ and 
[0,0,1]^ shows that an = 022 = 0, and taking x = [1, 1, 0]^, [0, 1, 1]^ and 
[1,0, 1]^ shows that (ooi + oio) = (012 + 021 ) = (020 + 002 ) = 0. Thus if the 
quadratic is identically zero A is skew-symmetric. Finally, note that A is skew- 
symmetric iff any nonzero scalar multiple of A is skew-symmetric. We have thus 
proved 

Proposition 3. Every point P lies on the line f{P) iff the matrix of f is skew- 
symmetric. 

A correlation whose matrix is skew-symmetric is called a null- polarity. Such a 
correlation is not invertible because a 3 x 3 skew-symmetric matrix is always 
singular. A correlation whose matrix is symmetric is called a polarity, note that 
a polarity may or may not be invertible. 

Any matrix A is the sum of a symmetric matrix and a skew-symmetric 
matrix. Indeed, let B = i(A + A^) and C = ^(A — A^); then evidently B is 
symmetric, C is skew-symmetric, and A = B + C. Moreover, P = x satisfies 
x^A^x = 0 iff it satisfies x^B^x = 0, since x^C^x = 0 for all x. Thus 
if A is not skew-symmetric, P = x lies on /(P) iff x lies on the quadratic 
locus x^B^x = 0. A correlation / for which both B and C are nonsingular is 
sometimes called a simple correlation ([35] p.394). 

We next show that / and its transpose (i.e., the correlation whose matrix 
is the transpose of that of /) have a useful “incidence-symmetry” property (see 
also Section 4.3): 

Proposition 4. Let A be the matrix of f and let f* be the correlation defined 
by A^. If Q lies on f{P) then P lies on f*{Q). 



Proof Let P = x and Q = y. Then Q lies on /(P) implies (Ax)^y = 0. Taking 
transposes gives y'^(Ax) = 0, i.e. (A^y)^x = 0, which implies that P lies 
on r(Q). □ 
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If is a scalar multiple of A the proof of Proposition 4 shows that / itself 
has the “incidence-symmetry” property: if Q lies on f{P) then P lies on f{Q). 
But evidently A^ = A A iff = 1, i.e. A = ±1; thus we have 

Corollary If the matrix A of f is symmetric or skew- symmetric (i.e., if f 
is a polarity or a null- polarity) and Q lies on f{P) then P lies on f{Q). 

In the remainder of this section we discuss conics and their relationship to 
polarities. A conic is the set of all points x in the real projective plane that satisfy 
a homogeneous quadratic equation aooXQ-\-aiixf-i-a 22 X 2 + ‘ 2 aoiXoXi-i- 2 ai 2 XiX 2 -i- 
2020^2^0 = 0 where the a’s are not all zero. Note that since we are working in the 
real projective plane a conic may be empty — for example, if ago, an, 022 are all 
nonnegative and ooi = 012 = 020 = 0. If we let oio = ooi, 021 = 012 , ap 2 = 020 
and let A be the symmetric matrix ( 0 ^), 0 < i,j < 2, the equation becomes 
x^Ax = 0. A conic is called degenerate if A is singular. It can be shown that a 
conic is degenerate iff x^Ax can be written as the product of two not necessarily 
distinct linear factors; they are distinct if the rank of A is 2 and are the same 
if its rank is 1 (see [46] pp. 81-83). It follows that a degenerate conic cannot be 
empty. For a discussion of the classification of conics see, e.g., [31] pp. 39-41; [.59] 
p.103; [61] pp.122-125. 

If C is the conic x^Ax = 0 where A is symmetric, the correlation / : x — > 
u = Ax is called the polarity associated with C. The line u is called the polar 
of the point x with respect to C, and the point x is called the pole of the line 
u with respect to C. If C is nondegenerate, so A is nonsingular, the polarity 
is invertible and we can also define a line-to-point mapping g : u — > (A^^)^x. 
Since A (and therefore A~^) is symmetric this polar-pole mapping is both the 
inverse and the dual of the pole-polar mapping. 

We will now discuss the polarities associated with nondegenerate, nonempty 
conics, which we call C-polarities. For a discussion of the polarities associated 
with degenerate conics see [59] p.ll8. Exercise (i). 

Let (7 be a nondegenerate, nonempty conic. A line L can intersect C in at 
most two points. If it does not intersect C it is called an exterior line; if it 
intersects C in one point P it is called a tangent (more fully: the tangent to C 
at P)\ and if it intersects C in two points it is called a secant. These concepts 
are illustrated in Figure 1. 

For any point P at most two lines through P are tangent to C. If there are 
no such lines P is called an interior point of C; there is exactly one such line 
iff P is on C; and if there are two such lines P is called an exterior point of C. 

The polar (with respect to C) of an interior point is an exterior line; the 
polar of a point P on C is the tangent to C at P; and the polar of an exterior 
point P is the line QR, where the two tangents to C through P are tangent to C 
at Q and R. The line QR is called the “chord of contact” of the two tangents 
(“secant of contact” would be better term). Thus a point P lies on its polar 
iff P lies on C, and its polar is then the tangent to C at P. These concepts are 
illustrated in Figure 2. 
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Every point on an exterior line, and every point on a tangent except for the 
“point of contact” where the tangent intersects C, is an exterior point. A secant L 
contains both interior and exterior points; if L intersects C at Q and R, and S, T 
is a pair of points on L that are separated by Q,R (see Section 5.1), then one 
of S,T is interior and the other is exterior. For further details see [18] pp. 56, 
86-87; [19] pp. 83-84; [45] pp. 196-198; [46], pp. 74-75; [48] pp. 138-139; [59] p. 

no. 

If a Hough transform is defined by a C-polarity, the geometric relationships 
between points (poles) and their corresponding lines (polars) can be used to 
simplify the search for peaks in the transform. For example, if the Pi’s are 
collinear points that lie on a line exterior to C, the corresponding lines Li must 
intersect at a point interior to C. 

If the points W, X, V, Z are collinear, the pair of points IF, X is called har- 
monic with respect to the pair Y, Z if the cross-ratio 

\WY\/\YX\ 

\WZ\/\ZX\ 

= —1, where \UV\ is the signed length of the line segment UV. Two distinct 
points P, Q are called conjugate with respect to C if the line PQ intersects C in 
two points and P, Q are harmonic with respect to the two points of intersection. 
It can be shown that for any point P the locus of the points that are conjugate 
to P with respect to C is a line, and this line is the polar of P with respect to C 
(see [29] p.l07; [46] p.73; [59] p. 108-109). Thus P is conjugate to Q iff Q lies on 
the polar of P. 

4.2 Canonical Forms for the Matrices of Correlations 

In this section we discuss how the matrix of a correlation can be put into a 
“canonical form” — specifically, with respect to an equivalence relation on ma- 
trices called “congruence” . 

Let /, f be correlations and let A, A' be the matrices of /, /' respectively. We 
call / and f' (or A and A') congruent if there exists a nonsingular matrix D such 
that A' = D^AD. We can regard D as a matrix of a point-to-point collineation 
and as a matrix of a line-to-line collineation; the effect of A' on any point x 
is obtained by first applying D to x, then A to Dx, and finally to the line 
A(Dx). If A' is congruent to A the points that lie on their corresponding lines 
under A' are related to the points that lie on their corresponding lines under A 
by 

Proposition 5. Let A' = D^AD and let y = D~^x. If x lies on Ax then y 
lies on A'y. 

Proof: By Proposition 2 x lies on Ax iff x^A^x = 0 and y lies on A'y iff 
yTj^/Ty _ Q jf X lies on Ax we have y^A'^y = (D^^x)^A'^(D^^x) = 
x^(D~^)^(D^A^D)(D"^x) = x^A^x = 0; hence y lies on A'y. □ 
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Congruence of matrices is evidently an equivalence relation and congruent 
matrices have the same rank. Congruence also preserves the property of being a 
polarity or null-polarity. Indeed, if A, A' are congruent we have A' = D^AD for 
some nonsingular D; hence (A')^ = D^A^D, and if A is symmetric or skew- 
symmetric so is A'. It can be shown that a matrix is congruent to a diagonal 
matrix iff it is symmetric ([49] Theorem 6.4.1). We shall now discuss how a 
matrix can be put into a canonical form under congruence, beginning with the 
cases where A is symmetric or skew-symmetric. 

If A is symmetric and has rank r it can be shown (see, e.g., [24] Theo- 
rem 5.5.4; [35] Ch.9) that A is congruent to a diagonal matrix of the form 

/Ip o o\ 

O —Ir-p O where Ip and Ir-p are identity matrices of sizes p x p and 

\o o o J 

(r — p) X (r — p) respectively and the O’s are zero matrices of appropriate sizes. 
(These results hold for any nxn matrix A over the field of real numbers. Simpler 
results hold for matrices over the field of complex numbers.) The uniqueness of p 
was discovered by Sylvester in 1852; it is called Sylvester’s law of inertia. For 
example, if A is 3 x 3 and nonsingular, the only possible canonical forms are I3, 

, and ^ Q ^ because it is a scalar multiple 

Two conics x^Ax = 0 and x^A'x = 0 are called congruent if A and A' 
are congruent. Since the matrix of a conic is symmetric and the matrix of a 
nondegenerate conic is nonsingular, this gives us 

Proposition 6. A nondegenerate conic x^Ax = 0 is congruent to one of the 
following canonical forms: (i) Xq^ + xf + X 2 = 0, (ii) xf^ + x\ — x\ = Q, or (Hi) 

Xq^ — X3 — X2 = 0. 

Note that in the real projective plane the first of these canonical forms is an 
empty conic and the other two are nonempty. Thus congruence preserves the 
property of being a C-polarity, which is equivalent to having canonical form (ii) 
or (iii). 

Suppose next that A is skew-symmetric. It is well known that the rank of a 
skew-symmetric matrix must be even (e.g., [24] p. 236). Since in our case A is 
3 X 3 its rank must be 2. It can be shown (e.g., [24] Theorem 5.4.1; [35] Ch.9) 

/ 010 \ 

that A is congruent to the matrix — 1 0 0 . Thus all skew-symmetric matrices 

\ 0 0 0 / 

have the same canonical form, so that all null-polarities are congruent to each 
other. 

One way to obtain a canonical form for a general correlation is to express 
A as B -I- C where B is symmetric and C is skew-symmetric (see Section 4.1) 
and then to simultaneously reduce B and C to canonical forms using a suitable 
congruence transformation. This approach is described in detail in [35] p. 397, 
Theorem II for nxn matrices over an algebraically closed field. It applies to 



fh o 

Vo -ii 
of I3. 
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matrices over the field of complex numbers but not to matrices over the field of 
real numbers. 

The canonical form of an n x n matrix over a field that is not necessarily 
algebraically closed is treated in detail in [63] Chapter 7, Section 9. For a 3 x 3 
matrix over the field of real numbers all the possible canonical forms are 

/dll 0 0\ /dll 0 0\ /dii0 0\ 

I d2l C?22 ^ ) ’ I ^ ^ ^ ) 

\d3i ds2 ds3 J \d 31 d 32 Oy yd3i0 0 y 



where the d’s are real numbers. It is not hard to see that if the d’s are nonzero 
these canonical forms are not congruent to each other. They are the canonical 
forms of matrices that have rank 3 (nonsingular matrices), 2, and 1, respectively. 
Example: As we saw in Section 3.1 the correlation that was used in Hough’s 
'0 0 - 1 ' 

patent has matrix A = | 0 1 0 | . Since A has determinant 1 it is nonsingular, 



so its canonical form must be a matrix of the form 



dll 0 O' 

d21 d22 0 

, d3i ds2 ds3 



Let D be 



the nonsingular symmetric matrix 



1 0 0 




. It can be verified that D^AD 



2 1 0 I ; this matrix is the canonical form of A. 

2 2 

A is neither symmetric nor skew-symmetric, so Hough’s correlation is neither 
a polarity nor a null-polarity. If we decompose A into B and C as described in 



Section 4.1 we find that B = 



0 0 0\ 


/O 0 


-1 


0 10 and C = 


00 


0 


0 0 0/ 


1 10 


0 



note that both B 



and C are singular, so Hough’s correlation is not a simple correlation. 

The variant form of Hough’s correlation given in [-56] p.l23 takes the point 
[u,L!, 1]^ into the line ux + y — v = 0 which has coefficients [u, 1, — so it 



has matrix A' = 




. It can be verified that using D = 




transforms A' into the canonical form D^A'D = 




note that this 



is not the same as the canonical form of A. 

If we give the d’s (in the canonical form of a general nonsingular 3x3 matrix) 
values other than those in the canonical form of Hough’s correlation we obtain 
infinitely many incongruent correlations none of which are congruent to Hough’s 
correlation. For example, setting all the d’s equal to 1 gives the canonical form 
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110 . This nonsingiilar correlation is not a polarity since its matrix is not 

Viii; 

diagonal (see the paragraph following Proposition 5). 

Other types of transformations can also be used to put a matrix into canon- 
ical form. For example, the matrices A, A' are called similar if there exists a 
nonsingular matrix D such that A' = D~^AD. A canonical form for n x n ma- 
trices under similarity, where no assumption is made about the algebraic closure 
of the underlying field, is called the rational canonical form; it was first described 
by Frobenius (1849-1917). A simpler canonical form called the Jordan canonical 
form exists when the underlying field is algebraically closed. For details about 
both of these forms see, e.g., [10,32]. However, these canonical forms seem to be 
of less interest in our case; for example, if A, A' are similar and A is a polarity 
or null-polarity A' is not necessarily a polarity or null-polarity. (Interestingly, it 
can be shown that the two variants of Hough’s correlation discussed above have 
the same rational canonical form.) Note that if D is an orthogonal matrix we 
have = D^, so similarity and congruence are the same. Canonical forms for 
polarities in the case where D is orthogonal are studied in Section 1.8.2 of [31], 
but we will not consider them here. 

4.3 General Point-to-Line Mappings 

In this section we consider (not necessarily linear) point-to-line mappings that 
have the incidence-symmetry and incidence-preserving properties described in 
Section 4.1, and we discuss conditions under which these properties imply lin- 
earity. 

Let / be a point-to-line mapping defined on the (real) projective plane. We 
call / incidence- symmetric if Q G f{P) implies P G f{Q)- Similarly, if 5 is a 
mapping that takes lines into points we call g incidence-symmetric if g{L) S M 
implies g{M) G L. As we saw in the corollary to Proposition 4, if / is a linear 
correlation and its matrix is symmetric or skew-symmetric (i.e., / is a polarity or 
null-polarity) then / is incidence-symmetric; the analogous result evidently also 
holds for 5 . It is not hard to see that if / is incidence-symmetric iff its inverse is 
incidence symmetric. 

Proposition 7. If f is onto and incidence- symmetric it takes collinear points 
into concurrent lines. If g is onto and incidence- symmetric it takes concurrent 
lines into collinear points. 

Proof. For any line L we have L = f{P) for some P, and for any Q G L, f{Q) is 
a line through P. For any point P we have P = g{L) for some L, and for any M 
through P, g{M) is a point of L. □ 

From Theorem 1 and Proposition 7 we have 

Theorem 2. A one-to-one incidence- symmetric point-to-line correspondence 
must be linear. 
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Let / be one-to-one and incidence-symmetric. By Theorem 2 / must be 
linear; let A = (oij) be the matrix of / (in a given homogeneous coordinate 
system). Let P and Q be the points [xq, cci, ^2]^ and [yoj 2/1; 2/2]^ respectively. 
By incidence-symmetry Q G f{P) implies P G f{Q)- Now Q G f{P) means 
[xo,xi,a;2]A'^[?/o,2/i,y2]’^ = 0 ; taking transposes gives [yo, 2/i, 2/2]A[xo, xi, X2]^ = 
0 . By incidence-symmetry this implies P G f{Q), i.e. [yo, yi, y2] A^[xq, xi,X2]'^ = 
0 . Now [yo,yi,y2]A[a;o,xi,a;2]^ = xo(yoaoo + yiOoi -l-y2ao2) -l-xi(yoaio -b yion -|- 
2/2012) -ba:2(yoa2o + yi02i -b 1/2(122), and similarly for [yo, yi, y2]A'^[xo, xi, ^2]^. 
Let [xq, X2] = [ 1 , 0 , 0 ]; then by incidence-symmetry yoooo + yiooi -b y2O02 = 0 
implies yoOoo + yiaio + y2<i20 = 0 . If y2 = 0 and yi yf 0 the first equation can be 
rewritten as ooo 2 /o/ 2 /i = ~ooi and the second equation as oooyo/ 2 /i = ~oio; hence 
if the first equation implies the second we must have ooi = oiq. By assuming 
instead that yi = 0 and y2 0 we can similarly show that 020 = 002- Similarly, 
by letting [xo,xi,X2] = [ 0 , 1 , 0 ] or [ 0 , 0 , 1 ] we can prove that 012 = 021; thus A 
is symmetric, so / is a polarity. Since (as pointed out at the beginning of this 
section) a polarity must be incidence-symmetric, we have thus proved 



Theorem 3. A one-to-one point-to-line correspondence is incidence- symmetric 
iff it is an invertible polarity. 

We can now show that the converse of Proposition 7 is false. If / is a one- 
to-one point-to-line correspondence that takes collinear points into concurrent 
lines, by Theorem 1 it must be linear and its matrix must be nonsingular; but 
since the matrix needn’t be symmetric / needn’t be incidence-symmetric. 

Some useful properties of an invertible polarity / can be derived from the 
fact that / is one-to-one and incidence-symmetric: 

1 ) Let f{P) = L] then / is a one-to-one correspondence between the points 
of L and the lines through P. 

2 ) For all P yb Q we have /[/(P)n/(Q)] = PQ (the line containing the points P 

and Q). Proof: Since / is one-to-one, P Q implies /(P) f{Q)- Let 

/(P)n/(Q) = R; then R G /(P) and R G f{Q), so by incidence-symmetry P 
and Q are both on f{R); hence f{R) = PQ. [We have not assumed that 
P G f{Q) or Q G /(P). Let /(P) D PQ = Q' and /(Q) n PQ = P'; 
then f{P') = QR and f{Q') = PR. Also P', P, Q, Q' are collinear and 
Z(P'), /(P), /(Q)Z(Q') are concurrent (at R).] 

3 ) If P G /(P) and Q G /(Q) then R = /(P) n /(Q) ^ PQ ([ 19 ] p. 67 ; [ 20 ] 
pp. 60 - 61 ). Proof: Both P and R are on f{P), so if R were on PQ,f{P) 
would be the same as PQ. Similarly, both Q and R are on /(Q), so if R were 
on PQ,f{Q) would be the same as PQ. This would imply /(P) = /(Q), 
contradicting the fact that / is one-to-one. 

4) If P G /(P) then for any Q P such that R = /(P) n /(Q) yb P we have 
R ^ PQ. Proof: If P G /(P) and R G PQ, P and R would both be on /(P) 
and on PQ, so /(P) = PQ. Hence Q G /(P), so P G /(Q). Thus P and R 
would both be on /(Q) and on PQ, so /(Q) = PQ; hence /(Q) = /(P), 
contradicting the fact that / is one-to-one. 
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Let / be a mapping that takes points into lines and g a mapping that takes 
lines into points. We call the pair (/, g) incidence-preserving if P £ L implies 
g{L) £ f{P). As we saw in Proposition 1, if / is a linear correlation and g is its 
dual the pair {f,g) is incidence-preserving. 

If / and g are inverses of each other incidence-preservation is closely related 
to incidence-symmetry; indeed, we have 

Proposition 8. If f and g are inverses of each other, {f,g) is incidence-preserv- 
ing iff f or g is incidence- symmetric iff they are both incidence- symmetric. 

Proof Let f{P) = L. If (f,g) is incidence-preserving Q £ L = f{P) implies 
P = g{L) £ f{Q), so / is incidence-symmetric. Similarly, let g{L) = P; then P = 
g{L) £ M implies g{M) £ f{P) = L, so g is incidence-symmetric. Conversely, 
let / be incidence-symmetric and let f{P) = L. Then Q £ L = f{P) implies 
P = g{L) £ f{Q), so (/, 5) is incidence-preserving; similarly, if g is incidence- 
symmetric, (/, g) is incidence-preserving. □ 



We saw at the beginning of this section that if / and g are inverses of each other, 
/ is incidence-symmetric iff g is incidence-symmetric. Note also that by Theorem 
3 if / is one-to-one and incidence-symmetric it is an invertible polarity; hence 
its matrix is symmetric, so its inverse g is the same as its dual, which implies 
that (f,g) is incidence-preserving by Proposition 1. 

By Propositions 7 and 8, if / and g are inverses of each other and (/, 5) is 
incidence-preserving, / takes collinear points into concurrent lines and g takes 
concurrent lines into collinear points, and by Theorem 2 / and g must both be 
linear. If / and g are not inverses of each other they need not be linear even if 
(/, g) is incidence-preserving; but they still take collinear points into concurrent 
lines and vice versa, as we see from 

Proposition 9. If (f,g) is incidence-preserving f takes collinear points into 
concurrent lines and g takes concurrent lines into collinear points. 

Proof: Let P, Q, R be collinear, say on L; then g{L) must lie on 
f{P),f{Q),f{R), so these lines are concurrent. Let K,L,M be concurrent, say 
at P; then g{K), g{L), g{M) must lie on f{P), so these points are collinear. □ 



5 Concluding Remarks 

This paper has made many basic observations about Hough transforms and 
point-to-line mappings. We summarize a few of these observations here: 

1) A wide variety of Hough transforms that make use of (nonlinear) point- 
to-curve mappings have been defined. Linear point-to-line mappings like 
Hough’s original mapping are computationally simpler, but have been ne- 
glected. 
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2) A one-to-one point-to-line mapping that takes collinear points into concur- 
rent lines, and so can be used as a Hough transform, must in fact be linear 
(Theorem 1). 

3) A one-to-one point-to-line mapping that preserves point-line incidence must 
also be linear, and its matrix must be symmetric, so it must be a polar- 
ity (Theorem 3). Because of their incidence-symmetry property, invertible 
polarities are an especially well-behaved class of point-to-line mappings. In 
particular, if an invertible polarity (with respect to a nonempty conic) is 
used as a Hough transform, the geometrical relationships between points 
and their corresponding lines can be used to simplify the search for peaks in 
the transform. 

4) The matrix of an invertible polarity is congruent to a diagonal matrix whose 
diagonal elements are (1, 1, 1), (1, 1, — 1), or (1,— 1,— 1). In the latter two 
cases the polarity is a C-polarity, i.e. it defined by a nonempty conic. Hough’s 

. 

mapping had matrix 0 1 0 and was not a polarity; the C-polarity whose 

\10 0 / 



matrix is 




might have been a better choice. 



This paper could be expanded in many directions, as suggested in the fol- 
lowing paragraphs. 



a) We have considered only point-to-line mappings and their use for detecting 
sets of collinear points. As pointed out in Section 2 much of the work on 
Hough transforms [38,44,54] has dealt with point-to-curve mappings, and 
such mappings are also often used for detecting sets of points that lie on a 
curvilinear locus such as a circle or ellipse. It would be of interest to generalize 
the concepts in this paper to mappings that take points into curves and that 
take sets of points that lie on a curvilinear locus into sets of concurrent 
curves. 

b) We have considered only mappings defined on the plane. The Hough trans- 
form approach has also been used to detect sets of collinear points (or sets of 
points that lie on other types of loci — e.g. space curves, planes, or surfaces) 
in higher-dimensional spaces [2,11,50]. It would be of interest to general- 
ize the concepts in this paper to loci in space. Note that in n-dimensional 
space there is a dual relationship between points and hyperplanes, not be- 
tween points and lines. Note also that if n is even a null-polarity can be 
invertible because a skew-symmetric matrix need not be singular. Invertible 
null-polarities in projective 3D space (using 4-tuples of homogeneous coor- 
dinates) have geometrical relationships to “twisted cubics” analogous to the 
relationships between polarities and conics in the projective plane. For ref- 
erences on correlations in n-dimensional spaces see [29] pp. 422-424; [35] Ch. 
9; [47]; [53] pp. 373-377; [58] pp.99-105; [59] pp. 359-363. 
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c) It would be of interest to study the effects of point-to-line mappings on vari- 
ous types of planar loci — for example, on conics. The relationship between 
point-to-plane mappings and quadric surfaces might also be of interest. 

d) Point-to-line mappings and twisted cubics are useful for camera calibration; 
see [25], [31], [52]. It would be of interest to investigate other applications of 
the concepts studied in this paper to the interpretation of images of three- 
dimensional scenes. 

e) For convenience, in this paper we have dealt only with the real projective 
plane. For some purposes it might have been advantageous to work in other 
types of planes. For example, 

(1) We could have worked in the Euclidean plane, as Hough did in [37]. The 
study of points and lines in the Euclidean plane is more complicated be- 
cause of the need to deal with exceptional cases (e.g., parallel lines), but 
it is more faithful to the geometry of real-world images. For a discussion 
of Hough transforms in the Euclidean plane see [1]. 

(2) As a compromise between the Euclidean and projective planes, we could 
have worked in the affine plane (see also [1]). This would have allowed us 
to distinguish between different types of conics and between intersecting 
and parallel lines. 

(3) We could have worked in the complex projective plane. This would have 
required us to introduce imaginary points and lines, but would have 
allowed us to give more general treatments of such topics as conics (see 
Section 4.1) and canonical forms (see Section 4.2). Many of the results 
in this paper remain valid in the complex projective plane. “Circular 
points” in the complex projective plane, which are discussed in many 
books on projective geometry, play key roles in camera calibration and 
in the analysis of images of 3D scenes, as discussed in [25,31,52]. 

(4) Many of the results in this paper involve only finite sets of points and 
lines and could possibly have been formulated as results about finite 
projective planes. (A finite projective plane is finite set of “points” and 
a finite set of subsets of the points, called “lines”, such that each pair of 
distinct points are contained in a unique line, each pair of distinct lines 
intersect in a unique point, and there exist four points that are not all 
contained in the same line.) For treatments of finite projective planes 
see [3,7,8,22,34]. Note that a digital image contains only finite numbers 
of points and lines, but it is not a finite projective plane because digital 
lines can intersect in more than one point. 

(5) Incidence properties of correlations were a central theme of this paper. 
Such properties can be studied in an abstract framework; see [9] , which 
presents a general theory of incidence structures and defines correlations 
on such structures. It would be of interest to generalize the results of 
this paper to such structures. 
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Fig. 2. Points and their polar lines with respect to a conic C: a) The polar of an 
interior point is an exterior line, b) The polar of a point P on C is the tangent 
to C at P. c) The polar of an exterior point Q is the secant joining the points 
at which the two tangents from Q meet C 
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Abstract. Euclidean geometry on a computer is concerned with the 
translation of geometric concepts into a discrete world in order to cope 
with the requirements of representation of abstract geometry on a com- 
puter. The basic constructs of digital geometry are digital lines, digital 
line segments and digitally convex sets. The aim of this paper is to review 
some approaches for such digital objects. It is shown that digital objects 
share much of the properties of their continuous counterparts. Finally, it 
is demonstrated by means of a theorem due to Tietze (1929) that there 
are fundamental differences between continuous and discrete concepts. 

Keywords: digital geometry, digital lines, digital convexity, Tietzes’s 
theorem 



1 Introduction 

The concept of convexity plays an important role in mathematics and also in 
applications. Specifically in visual perception convex sets are of importance since 
almost all visible objects are either convex or else composed of a finite number of 
convex sets (the “convex ring”, see e.g. [19,27]). Therefore a considerable part of 
books on digital geometry is devoted to convexity (see e.g. [2, Chapitre 5] or [31, 
Chapter 4.3]). In shape recognition two dimensional sets can be decomposed into 
perceptually meaningful parts by considering convex and concave parts of their 
boundaries [16]. There exist numerous generalizations of convexity [18, Chapter 
6.2]. Ronse gave a bibliography on convexity which covers the years 1961 to 
1988 [24]. A very detailed history of the topic is given by Hiibler [10]. For a quite 
recent account of the subject including also detailed historical informations the 
reader is referred to Klette’s survey article [12]. 

Convexity is closely related to the geometry of lines. Convex sets are defined 
by means of line segments and on the other hand line segments are convex hulls 
of two-point sets. In an abstract setting a hyperplane is a nonempty convex set 
whose complement is also nonempty and convex [29, Part II]. 

The aim of this article is to give a review on digital lines and digital convexity. 

Most of the proofs are omitted in this paper. Only in cases where the proof 
is not easily available or if it involves some special constructions, a sketch of it 
is indicated. 
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2 The Digital Space 

Given a linear space X over the real numbers M, for example the ordinary 
d-dimensional real vector space. In X we consider a certain subset X/i, the 
‘digital space’. In the standard example X = R*^, we can take X^ = which is 
the set of all vectors whose components have integer values. Although it is also 
possible to treat a good deal of the theory in irregular digital sets, we concentrate 
here exclusively on the case 

The topic of digital geometry is to translate continuous concepts, i.e. concepts 
in X into the digital world in X^- There are basically two possibilities. The 
pragmatie way is to define a diseretization mapping X X/^ . A digital set Sa C 
Xa is said to have a certain property if there is a continuous set S C X having 
this property such that Sa is the image of S under the discretization mapping 
(for a related approach see [13]). Of course, one has to take care that such a 
definition is well-defined. One disadvantage of this approach is that it is not 
canonic, it depends on the discretization mapping used. 

The other possible approach is the axiomatic way. Here, suitable characteris- 
tic properties are translated into the digital setting. The main advantage of this 
approach is that it is mathematically more attractive than the pragmatic way 
and that it allows to derive properties of the digital objects in a rigorous abstract 
way. In some fortunate cases both approaches lead to the same concepts. 

3 Digital Lines 

For sake of simplicity we now concentrate on planar sets, i. e. we consider 
(R^,Z^). In digital (plane) geometry two metrics are popular. Let x = (^ 1 ,^ 2 ) 
and y = {r]i,r] 2 ) two vectors in R^. Then the 4-metric is defined as 

di{x,y) = 1^1 - ?7i| -k 16 - m\ 



and the 8-metric is 



da{x,y) = max(|6 - mV 16 ~ ^i|) ■ 

These metrics are named according to the number of Z^-neighbors of distance 1 
of a point in 1? . 

A digital set Sa 6 I? is termed cr-connected, tr G {4,8}, if there exist 
points xi,X 2 ,---,x„ such that Xi G Sa for all i and do-(xi, Xi+i) = 1 for i = 
l,2,---,n- 1. 

This concept of connectedness induces a ‘topology’ (more precisely: a graph 
structure, see e.g. [18] or [31] for details) on I?. We define a digital a-curve to 
be a (T-connected digital set Sa with the property that each point x G Sa has 
exactly two cr-neighbors in Sa with the possible exception of at most two points, 
the so-called end points of the curve, having exactly one neighbor in Sa- A finite 
curve without end points is a closed curve. 
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For any set S' C and a point a: € we define for a S {4, 8} 



da{x,S) = inf da{x,y). 
y&S 

Then two discretization mappings can be defined 

Z\.(S) = jxGZ^ I ctG{4,8}. 

A4 is sometimes referred to as grid-intersection-discretization. 

Lemma 1. If S C is connnected then A^(S) is a* -connected, a G {4,8}, 



* ^ f 4 if a =8, 

{ 8 if a =4. 

By means of the discretization mappings A^- we could define a digital line as 
the image of a ontinuous line under one of these mappings. However, the image 
of a line under Z\o- is not necessarily a digital cr-curve. If one wants to get digital 
lines which are also digital curves, the discretization mapping must be modified 
such that each point in is mapped to exactly one point in Z^. If we define 
the influence region of a point x = (i,j) € Z^ by 



Trcr(x) 



2/e 



da{x,y) < 



then the union of all 7T8(x), x G 1? covers the plane whereas for the union 
of all 7T4(a;) this is not true. Even more important, the influence regions have 
boundary points in common. This accounts for the fact that there exist points in 
R^ which are mapped on more than one point in Z^ by means of the discretization 
mappings A„ . In the next section we see how this latter situation can be remedied 
in the case cr = 4. 



4 The Chord Property 

The first systematic approach to define digital lines by means of a suitable dis- 
cretization mapping dates back to 1974 when Rosenfeld published his paper on 
digital straight line segments [26] . For the proofs of the theorems of this section 
the reader is referred to Rosenfeld ’s original paper or the paper of Ronse [25]. 
For X and y in R^ the (continuous) line segment joining x and y is 

[a;, y] = { 2 ; G R^ \ z = \x -\- {1 — X)y, 0 < A < 1} . 

A digital set Sa Q is said to possess the chord property whenever for any two 
points X, y in Sa and for any u G [x, y] there exists & z G Sa such that d^,{z, u) 
< 1. Sa has the strict chord property if the strict inequality sign holds. 

Lemma 2. A digital set Sa which has the strict chord property is 8-connected. 
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In accordance with the pragmatic approach to digital geometry we define: a 
digital (a-) line segment is a digital set which is a discretization of a line 
segment in 

Theorem 1. Each 4,-line segment has the chord property. 

Remark 1. It is generally not true that a 4-line segment has the strict chord 
property. 

We define the modified influence region (see figure 1) of a point x = {i, j) G 



Note that each point in belongs to at most one set x G Z, in 

contrast to the situation with influence regions tt ^^ x ). 

The modified grid-intersection-discretization then is given by the mapping 



Using this concept, we can formulate a stronger variant of theorem 1. 

Theorem 2. Any digital line segment generated by the modified 4,-discretization 
mapping Z \4 is an 8-curve having the strict chord property. 



by 




A'^(S) = {x G Z2 I n'^{x) n 5 ^ 0}. 




1 ^ 1 
2 2 



Fig. 1. The modified influence region 7T4 (x). Parts of the boundary indicated by 
thick lines belong to the influence region as well as the vertices marked • 



Digital Lines and Digital Convexity 213 



Remark 2. The assertion of theorem 2 is very appealing since it connects a cer- 
tain discretization method with a definition for digital lines which always leads 
to lines having the strict chord property and being 8-curves. The price we have 
to pay for this greater symmetry is lack of invariance. The mapping A'^ is not 
invariant with respect to 90° rotations of the plane. 

In this form digital lines were introduced by Rosenfeld [26]. Usually, if we 
refer in this paper to digital lines, then a zi^-line is meant. 

Corollary 1. Given a digital set such that for any two points x and y in Sa 
there is a digital A^- (A'^~) line segment containing x and y which is contained 
in Sa- 

Then Sa has the (strict) chord property. 

There is a converse of theorem 2: 

Theorem 3. Given a digital set Sa U 1? having the strict chord property. Let x 
and y he points in Sa- 

Then there exists a line segment 7 C K.^ such that A'^ipf) C Sa o,nd x,y G 

^'4(1)- 

The proof of this theorem needs Helly’s theorem [29, Part VI] from convexity 
theory [25,26]. 

Corollary 2. A digital 8-curve Sa having the strict chord property is a digital 
line segment (in the A'^^ sense). 

The theory sketched here has some advantages. Specifically, it translates the 
common concept of a line or a line segment, respectively, in a quite straightfor- 
ward way to the digital case via a quite natural discretization mapping. All con- 
structs used here can be generalized to higher dimensions [22,23]. However, it is 
not easily possible to verify the chord property for a given digital set. There exist 
different approaches for finding a characterization of digital lines and line seg- 
ments which makes exclusively use of Z'^-concepts. Moreover, these approaches 
lead to algorithms with linear time-complexity. The two main approaches in 
this direction are the syntactic characterization of Freeman [7], Rosenfeld [26] 
and Hiibler, Klette and Voss [11] and the arithmetic characterization of Debled- 
Rennesson and Reveilles [4] . There slso exists a very interesting number-theoretic 
approach by Voss [30] and Bruckstein [1]. This latter approach, however, will not 
be treated here. 

5 Syntactic Characterization of Digital Line Segments 

The chain code for coding digital 8-curves was proposed by Freeman [6]. 
The 8-neighbors of a point x = {i,j) G Z^ are numbered according to the 
following scheme. 
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A digital 8 -curve can be described by means of a simple compact linearly 
ordered data stucture containing the coordinates of one of its end points (or of a 
fixed point on it if a closed curve is treated) and a sequence of code numbers in 
{0, 1,2,3, 4, 5,6,7} indicating for each point on the curve which of its neighbors 
will be the next point on the curve. 

The following theorem was proved by Rosenfeld [26]. 

Theorem 4. Given a digital 8-curve consisting of more than three points which 
has the strict chord property. Let oi, 02 , • • • , a„ he the set of chain code numbers 
of the curve, 0 < < 7. Then 

1. There are at most two different code numbers among the ai, 

2. If there are two different code numbers among the at, say a and (3, then these 
belong to adjacent directions, i.e. a = /3 ± 1 (mod 8 ), 

3. If there are two different code numbers, one of them is singular, i.e. it never 
occurs twice in succession. 

A digital 8 -curve having the properties of theorem 4 is not necessarily a 
digital line segment. 

For digital lines (with rational slope) the following theorem holds: 

Theorem 5. Let TGI? be a digital line. Then the following three assertions 
are equivalent: 

1. The chain code of T fulfills the conditions of theorem j and is periodic. 

2. There exists a line with rational slope so that T is the A 4 discretization of 
this line. 

3. All lines generating T by discretization have the same (rational) slope. 

For digital lines which are characterized as in theorem 5 the axiom of par- 
allelity is not necessarily true. There do exist pairs of parallel lines which are 
different but not disjoint. The intersection of such parallel lines is not necessarily 
a digital line. Moreover, the intersection of two lines can contain more than one 
point. 

Freeman [7] proposed as a heuristic criterion that the singular direction 
should be distributed as regularly as possible in the chain code. Hiibler, Klette 
and Voss [11] made this heuristic statement precise by giving a syntactic char- 
acterization of digital lines and digital line segments. Assume that we are given 
a sequence F = {aj}j^z of code numbers, 0 < Oj < 7. An element of this 
sequence is singular whenever ak-i and Ofe+i, otherwise Ofc is regu- 

lar. A regular piece of F" is a finite subsequence of maximal length of successive 
elements of F with mutually equal code numbers. 
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Let T = be the set of all sequences of integers (a^ € N for 

all fc) . We introduce a reduction operator R : T — > T by means of the following 
procedure: 

— Delete all singular elements in F which are adjacent to two regular pieces. 

— Replace all regular pieces of F by their lengths. 

— Leave all other elements of F unchanged. 

The sequence F is said to possess the Freeman property if 

Fi There exists a number a € N such that Oj S {a, a + 1} for all j. 

F 2 Whenever F contains both numbers a and a + 1 then at least one of them 
occurs only singularly. 

The sequence F is said to possess the HKV (Hiibler, Klette, Voss) property 
if each reduced sequence R^{F), fc = 0, 1, 2 • • • has the Freeman property. 

We now introduce a convexity concept. A digital set Sa V is convex in 
the sense of Minsky and Papert or MP-convex if for any two points x and y 
in Sa and z G [x, y] n Z^ then z & Sa [20]. 

Theorem 6. Any digital 8-curve which is unbounded in both directions and MP- 
convex has the PlKV-property. 

Proof. The proof of this theorem is rather long and technical but straightforward. 
We give only a sketch of it. First it is shown that the chain code of an MP- 
convex curve has the Freeman property. This can be done by straightforward 
enumerative discussion of all possible cases. 

Then it is shown that each reduced code sequence has the Freeman property. 
In order to do this it is first shown that the Freeman code of an MP-convex 
curve consists of only two different regular pieces of size k and /c -|- 1 . It is easily 
seen that one of these regular pieces occurs singularly. 

The converse of theorem 6 is also true [10]: 

Theorem 7. Any digital 8-curve which is unbounded in both directions and has 
the HKV-property is MP-convex. 

Proof. One basic trick for proving this theorem is due to Hiibler. Without loss 
of generality we can assume that the chain code F for the digital curve Sa has 
only code numbers 0 and 1. If the reduced sequence R{F) has elements k and 
fc -|- 1 and if fc -|- 1 is singular then the linear mapping of the (real) plane with 
matrix 

V 1 oj 

maps the vectors (1,0) and (1,1) on the elements (fc -I- 1,1) and (fc -I- 2,1), 
respectively. The mapping Sa MS a, if interpreted as a mapping from to 

R^, is invertible and M~^R{Sa) can be interpreted as the chain code of a digital 
curve. 

The remainder of the proof is rather straightforward. 



216 



Ulrich Eckhardt 



For digital line segments the HKV-property must be modified in order to cope 
with boundary effects. For sake of completeness we indicate this modification. 
For details the reader is referred to the original paper of Hiibler, Klette and 
Voss [11]. 

Denote by Tq the set of all finite sequences of integers. As above we define 
singular and regular elements in a sequence and also the Freeman property for 
sequences F ^ To. 

The modified reduction operator R' : To — > To is defined as follows: All 
maximal subsequences of successive regular elements of F which are between 
two singular elements are replaced by their lengths and all other elements are 
deleted. If F contains no singular elements, F is replaced by its length. 

For a sequence F G To with the Freeman property we denote by £{F) the 
number of regular elements preceeding the first singular element in F and by 
r{F) the number of regular elements following the last singular element in F. 
The modified HKV-property then looks as follows: 

HKVi All reduced sequences R'^{F), fc = 0, 1, 2, • • • have the Freeman property 
(or are empty). 

HKV2 If R'^{F), fc = 1, 2, • • •, consists of identical elements a or of two different 
elements a and a -I- 1 then i{R'^~^ {F)) < a -|- 1 and r{R'^~^{F)) < a -I- 1. 
HKV3 If R'^{F), k = I, 2, • • •, contains two elements a and a -I- 1 such that a 
(< a -I- 1) is nonsingular, then 

R'^{F) starts with a if (,{R'^~^{F)) = a+1 and 
R'^{F) ends with a if r{R'^~^{F)) = a -I- 1. 

The following theorem characterizes digital lines by the HKV-property. The 
theorem was first stated by Hiibler, Klette and Voss [11]. The first proof of the 
theorem was given by Wu [32] (see also [10]). 

Theorem 8. A (finite) 8-curve is a digital line segment if and only if it has 
the modified HKV-property. 

The relevance of the theorems of this section lies in the fact that the modified 
HKV-property can be verified by a syntactic parsing of the chain code of a given 
digital curve. This process obviously can be performed in time proportional to 
the length of the curve. 

6 Arithmetic Characterization of Digital Line Segments 

An alternative definition of a digital line is due to J.-P. Reveilles [4]. Given 
numbers a, 6 G Z with & yf 0 such that the greatest common divisor of a and b 
is 1 and a number /i g Z, then a digital line is the set 

T>{a, b, p.) = {{x,y) G \ p < ax — by < p-\- max(|a|, |6|)} . 



( 1 ) 



Digital Lines and Digital Convexity 217 



This definition is an immediate consequence of the modified grid-intersection- 
discretization of a real line. To a (nontrivial) digital line segment one can 
associate two lines in namely the line 

I a^-6?7 = /r} 



and 

77) G R^ I — &?7 = max ax — by}. 

(x,v)gSa 

The strip bounded by these two lines contains the digital line segment. 

If an 8-curve is to be tested for linearity, a system of Diophantine inequalities 
has to be solved, since for each point (x, y) on the curve an inequality of the 
form (1) yields a condition for the integer variables y, a, h. The solution of such 
a system is straigtforward and easy and can be done in time proportional to the 
length of the curve. 



7 Digital Lines and Translations 

Hiibler [8,9,10] proposed a theory of digital lines which is based on translations. 
First one can state: 

Observation The chain code of a digital line is periodic if and only if there 
exists a nontrivial translation of 1? which leaves the digital line fixed. 

This leads immediately to the following definition: Given a translation T : 
— > Z^. A digital line in the sense of Hiibler is any set of the form {xq -I- 
T"x I n G Z} with fixed vectors xq,x G Z^. We assume that the translation is 
irreducible which means that there is no other translation T' : W? — > 1? such 
that T = (T')^ with fc > 1. 

By means of this concept one gets digital lines which are not necessarily 8- 
curves but have attractive properties. 

— For any two different points x and y in 1? there is exactly one digital line 
containing these points. 

— If two lines are parallel then they either coincide or else they are disjoint. 

— Two lines are parallel if and only if there is a translation mapping one of 
them into the other. 

— Two lines are parallel if and only if each translation leaving one of them 
fixed also leaves the other fixed. 

— Two lines which are not parallel intersect in at most one point. 

A digital line segment is a finite subset of a digital line containing consecutive 
points of the line. 

The main advantage of Hiibler ’s definition is its potential for generalizations 
to U^. 

Any digital A\-\me as defined above always contains a line in the sense of 
Hiibler if it is generated by a line having rational slope. 
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8 Digital Convexity 

There exist different definitions of digital convexity in the literature. We list 
some of the most common definitions. Let Sa be a digital set in (or Z"^, 
respectively) . 

The first definition of digital convexity was stated in 1969 by Minsky and 
Papert [20] (see above; MP-convexity) . The main reason for introducing this 
concept was to give an example of a predicate which has parallel order 3 (i.e. it 
can be verified by looking at triples of points in a digital set) and which is not 



Sa is H-convex if Sa = conv5'/i n Z'^. 

D-convexity Sa is digital convex or D-convex if for x,y G Sa the zi^-line 
segment joining x and y belongs to Sa- 

DH-convexity Sa is digital convex in the sense of Hiibler or DH-convex if for 
x,y G Sa the line segment (according to Hiibler) joining x and y belongs 
to Sa- 

Remark 3. If one considers irregular digital spaces Xa C Mf" then the definitions 
of MP- and H-convexity carry over without any difficulty. However, if no three 
different points of Xa are collinear, each set Sa Q Xa is MP-convex. On the 
other hand, D- and DH-convexity depend on a geometry of lines in Xa which is 
compatible with lines in 

In figure 2 examples are given for different convexity definitions. 

All these definitions have in common that the different convex sets have some 
of the properties which are known from ordinary convex sets. For example, the 
intersection of two MP- (H-, DH-) convex sets is always an MP- (H-, DH-) convex 
set. However, as can be seen by the conterexample in figure 3, the intersection 
of two D-convex sets is not always a D-convex set. 

Let Sa be a set which is convex according to one of the definitions above. 
An interior point of Sa is a point x G Sa such that the direct neighbors No{x), 
N2{x), N/^{x) and Nq{x) all belong to Sa- The interior of Sa is the set of all 
interior points. It can easily be shown that the interior of a convex digital set is 
also convex. 

We state some assertions on the mutual relations of these convexity concepts. 
Theorem 9. A digital set is DH-convex if and only if it is MP-convex. 



‘local’. 



MP-convexity For x and y in Sa and z G [x, y] n Z^ => z G Sa- 
H-convexity The convex hull of Sa is the set 




The proof of this theorem is obvious. 
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Fig. 2. Examples of convex sets. The digital set (•) in the left picture is MP- 
convex and H-convex but not 8-connected, hence not H-convex. The digital set 
in the right picture is MP-convex but not H-convex 




Fig. 3. Two D-convex sets whose intersection is not D-convex. The origin of the 
coordinate system is at the leftmost point •. The real line segment joining points 
(0,0) and (4,2) generates the digital Zi^-line segment consisting of all points • 
and *. Similarly, the line segment joining points (1,0) and (5,2) generates the 
digital line segment consisting of points o and *. The intersection of both digital 
line segments (points *) is not 8-connected, hence not D-convex 
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Theorem 10. A digital set has the chord property if and only if it is S-connected 
and H- convex. 

Theorem 11. Any MP-convex and S-connected digital set is D-convex. 

The proof of this theorem is very similar to the proof of theorem 6. 

The relations of all these convexity definitions are illustrated in Figure 4. It 
is an interesting fact that under the assumption of 8-connectedness all these dif- 
ferent concepts coincide. In the sequel we will always assume that the sets under 
consideration are 8-connected so that there is no need to distinguish different 
convexity concepts. 




Fig. 4. Relations of different convexity definitions. Double frames indicate con- 
vexity definitions which imply 8-connectivity. Bold arrows mean that the corre- 
sponding relation holds under the assumption of 8-connectivity. Numbers at the 
arrows denote the theorems where the corresponding assertions are formulated 
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9 Convexity Verification 

Given a finite set S' C with n elements, the convex hull of S can be determined 
in time O(nlogn) [21]. 

The situation is more favourable if convexity of a polygonal set P in 
is investigated. Such a set can be understood as an ordered set of boundary 
vertices such that each pair of two consecutive vertices is joined by a line segment. 
Convexity of a polygonal set in can be verified in a time which is proportional 
to the number of vertices of the polygon. By moving along the oriented boundary 
of the polygonal set in such a way that the interior of the set is always on the left- 
hand side, one determines for each vertex the direction change of the adjacent 
line segments. The set Sa is convex if and only if at each vertex these line 
segments form a left turn. 

Given a (bounded) digital set Sa & it is very easy to determine its 
boundary points (which are all points in Sa having at least one 4-neighbor 
not in Sa)- In the set of boundary points of a simply connected digital set 
(i.e. a digital set which is 8-connected and its complement is 4-connected) is a 
closed 8-curve. If consecutive points on the digital boundary curve are joined 
by line segments, a closed polygonal curve is obtained. Thus, to each (simply 
connected) digital set in Sa C a polygonal set P{Sa) can be associated in a 
canonical way such that = Z^ n P{Sa)- 

Unfortunately, if Sa is convex in Z^, P{Sa) is not necessarily convex in R^. 
It was noted by Hiibler, Klette and Voss [11] that for a digital set in Z^ convex- 
ity can be verified by traversing the boundary along digital line segments and 
monitoring the direction changes at each intersection point of two consecutive 
digital lines. This yields a linear time algorithm for convexity verification in W? 
(see also [14]). 

The approach of Debled-Rennesson and Reveilles [4] also leads to a linear 
time algorithm for detecting convexity. 

The situation is more complex in higher dimensions. First of all, the com- 
plexity assertions for constructing convex hulls in the continuous space are no 
longer true, in dimensions > 3 the picture is more complicated. Ronse [22,23] 
was able to generalize the chord property to arbitrary dimensions. However, for 
algorithmic purposes the chord property is not very helpful. Hiibler’s approach 
is independent of the dimension [10], however, it is also not immediately suitable 
for convexity detection. Debled-Rennesson [3] was able to carry over the main 
ideas of the approach proposed by her and by Reveilles [4] to three dimensions. 

10 Tietze’s Theorem 

In continuous convexity theory (see e.g. [29]) the boundary points of a convex set 
are classified by means of separation arguments. It is a remarkable fact that such 
arguments play virtually no role in digital convexity theory. Only in a more recent 
paper of Latecki and Rosenfeld digital supportedness was investigated [17]. 
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Assume that is equipped with Euclidean geometry, i. e. to each pair of 
vectors x = (Ci, ^ 2 , ■ • • , ^d) and y = (?7i, ?72, • ’ ’ Vd) the inner product 

d 

{x, y) = Y^ 

t=i 

is assigned. Given a nonzero vector x* G and a real number a, a hyperplane 
is a set 

[x* : a] = {x € R'^ I (x, x*) = a} . 

The following Characterization Theorem holds 

Theorem 12. A digital set Sa Q is digital convex if and only if for any 
point X gU^\ Sa there exists a hyperplane [x* : a] such that (x,x*) = a and 
(y,x*) > a for all y G Sa- 

This theorem is an obvious consequence of the Separation Theorem for Con- 
vex Sets [29]. A digital variant of the theorem may be found in [17]. A point of a 
set S C R"^ is termed an exposed point of S if there exists a hyperplane [x* : a] 
such that (x,x*) = a and (y,x*) > a for all y G S. Using this concept, the 
assertion of the following theorem is also known from convexity theory [29]; 

Theorem 13. If each boundary point of a digital set is an exposed point then 
the set is (digital) convex. 

In 1929 H. Tietze [28] proved a remarkable theorem. A set S' C R'^ is termed 
weakly supported locally if for each boundary point x of S there exists an (open) 
neighborhood C/(x) and a hyperplane [x* : a] such that (x,x*) = a and 

{y,x*) < a for y G U{x), y ^ X V i S. 

Tietze’s theorem now is [29, Theorem 4.10]. 

Theorem 14. Let S he an open connected set in R"^. If each point of the bound- 
ary of S is one at which S is weakly supported locally, then S is convex. 

The assertion of Tietze’s theorem means that in R'^ each set can be tested 
for convexity by looking only locally at boundary points. It is surprising that in 
Tietze’s theorem one needs topological assumptions (S is required to be open). 
It would be very attractive to translate this theorem into the digital world since 
it would allow to test any digital set for convexity in a time proportional to the 
number of its boundary points. Unfortunately, however, Tietze’s theorem does 
not hold in general in digital spaces. 

For a point x G denote by A/s(x) the set consisting of x together with 
its 8-neighbors. First we define: Given a set Sa C Z'’*. A point x G S'/i is a locally 
exposed point of Sa if there exists a hyperplane [x* : a] such that (x,x*) = a 
and {y,x*) > a for all y G {Sa H A/s(x)). 

A set Sa Q is said to meet the interior point condition at point x G Sa if 
the digital set (JVs(x) n 5) \ {x} is 4-connected and contains at least two points. 
Now the following theorem is true: 
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Theorem 15. Let Sa ^ be a digital set and let x be a boundary point in Sa 
whieh is loeally exposed. Assume further that the interior point eondition does 
not hold at x. 

Then there exists a digital set S'^ with J\fs{x) n Sa = C such that 

each boundary point of S'^ is a locally exposed point and S'^ is not convex. 

The meaning of this theorem is that without the interior point condition 
nothing can be said about convexity of a set only by looking at neighborhoods. 
The theorem can be proved by a rather lengthy discussion of all possible cases. 
We treat only one of these cases. 

Since x was assumed to be a boundary point, one of the direct neighbors of x, 
say Nq{x), must not belong to Sa- We consider the case that also one of the 
indirect neighbors, say iVi(x), is not in Sa- Furthermore we assume that Nq{x) 
and N-^{x) are in Sa- By convexity of Sa is N 5 {x) ^ Sa- Therefore we have the 
following situation (• denotes a point in Sa, ° a point not in Sa and • denotes 
a point whose status with respect to Sa is left open): 



o 

X • 

o o • 



If the interior point condition does not hold, one of the remaining points in As (a:) 
belongs to Sa- If e.g. N2{x) were in Sa, then we choose S'^ as follows (all points 
which are not marked • are assumed not to belong to S'^): 




Obviously all points of S'^ are locally exposed points and S'^ is not convex. 

The situation becomes less pessimistic if one considers digital sets fulfilling 
the interior point condition. Each of the eight neighbors of a point either belongs 
to Sa or not, hence there are 2® = 256 different possible neighborhood configu- 
rations. These configurations can be obtained from 51 generating configurations 
by rotations and reflections. If one is interested in configurations fulfilling the 
interior point condition, there remain 8 generating configurations (these are ex- 
actly the “simple strict boundary point-configurations” of [5, p. 160]). 7 of these 
configurations are digitally convex and 6 of them are configurations of locally 
exposed points. These latter are given in Figure 5. 

There remains one configuration which fulfills the interior point condition 
and is convex but not locally exposed: 
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Fig. 5. Configurations of exposed points. • denotes points in Sa and o denotes 
points not in Sa- The number in the center of each configuration is the config- 
uration number ^(A/s) = where cj = 1 for points in Sa and Cj = 0 

for points not in Sa- It is assumed that the center point belongs to Sa 



• • • 

• 63 • 

• o o 



We formulate an encouraging result sharpening the assertion of theorem 13: 

Theorem 16. Given a simply connected digital set Sa such that all boundary 
points of Sa fulfill the interior point condition and are locally exposed. 

Then Sa is convex. 

Proof. Assume that Sa is simply connected and that all of its boundary points 
fulfill the interior point condition. Then the polygonal set P{Sa) in asso- 
ciated canonically to Sa as above is “regular”, i.e. the (topological) closure of 
its interior is the set itself (This can be seen easily by inspection of the neigh- 
borhood configurations in Figure 5). So, P{Sa) fulfills all conditions of Tietze’s 
theorem, which completes the proof. 

The relevance of the latter theorem is rather limited since digitally convex 
sets rarely have only locally exposed boundary points, which means that con- 
figuration 63 above (as well as its rotated or reflected versions) occurs quite 
often in realistic digital sets. If this happens, neighborhood information alone 
is not sufficient for testing convexity. However, it is possible to monitor the oc- 
curence of critical configurations and to remedy the situation. One possibility, 
of course, for treating this situation is to segment the boundary into digital line 
segments in the manner of Hiibler, Klette and Voss [11] or of Debled-Rennesson 
and Reveilles [4]. These approaches, however, are not local in the sense of Tietze’s 
theorem. 
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The problems encountered in connection with application of Tietze’s princi- 
ple are sketched here by means of a special application. For retrieving shapes 
from a pictoral database it is necessary to simplify the contours of the objects 
under consideration. This can be done by removing “irrelevant” points from the 
contour [16]. Such a procedure has the advantage that the original data structure 
(a linearly ordered list of points in 1?') is not changed. Moreover, this approach 
also has a favourable stability behaviour [15]. A first step in the method could 
be to replace the polygonal set P{Sa) obtained canonically from the chain code 
by a set Q{Sa) having the properties: 

1. Q{Sa) is a polygonal set obtained from P{Sa) by deleting vertices, 

2 . SA = Q{SA)nz\ 

3. Q{Sa) has the same “convexity behaviour” of the boundary as Sa- 

It can be shown that such a set Q{Sa) cannot be found in a canonic way from Sa- 
Of course, there is some freedom of choice in interpreting the third requirement 
which is not mathematically precise. 

In Figure 6 a specific digital set is shown. The set was reduced by deleting 
points from the boundary as long as this is possible in a unique way under the 
first two conditions indicated above. 

Further reduction under the conditions stated above is no longer possible in 
a well-defined way. The problem is that a part of the boundary may have the 
following form (notations as in Figure 6, left image): 





Fig. 6. Construction of an enclosing polyhedral set. In the left picture the 
boundary of a digital set is shown. Points having neighborhood configuration 
63 (up to rotation and reflection) are indicated by o. In the right picture the 
enclosing polyhedral set after detection of digital lines is given. The original 
boundary contains 297 points, the reduced polygonal set has 75 vertices 
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It is assumed here that the interior of the set is below the curve. It is possible to 
delete one of the critical points o, but this is not an invariant action. There are 
other pragmatic possibilities to cope with this situation, e.g. one can introduce 
additional points not in ’I? thus relaxing the first two conditions above. Another 
pragmatic approach which is based on a “relevance measure” associated to the 
vertices of the polygon is given in [16]. 

11 Conclusions 

The concepts of digital lines and digital convexity were introduced thirty years 
ago in a pragmatic way by means of the grid-intersection discretization. It turned 
out that these concepts share a number of properties with lines and convex sets 
in continuous spaces. It could be shown [10] that these pragmatic concepts can 
be embedded into a formal axiomatic theory. Moreover, different definitions of 
digital convexity coincide if in addition 8-connectedness is required. Thus, at 
present digital lines and digital convexity are well understood theoretically and 
the different approaches to the subject can be unified within the framework of 
a nice theory. 

Due to the practical relevance of the subject, a number of algorithmic meth- 
ods were developed to verify convexity of digital sets. They can be classified as 
syntactic [11], arithmetic [4] and number-theoretic characterizations [30]. Typ- 
ically, these methods allow to verify convexity in a time proportional to the 
length of the boundary of the set under investigation. 

There do exist differences between digital and continuous convexity. In con- 
trast to continuous sets, it is not possible to verify digital convexity by local 
inspections only. 
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Abstract. In this paper, we first define the curvature indices of vertices 
of discrete objects. Second, using these indices, we define the principal 
normal vectors of discrete curves and surfaces. Third, we define digital 
curvature flow as a digital version of curvature flow in discrete space. 
Finally, these definitions of curvatures in a discrete space derives discrete 
snakes as a discrete variational problem since the minimization criterion 
of the snakes is defined using the curvatures of points on the discrete 
boundary. 



1 Introduction 

The decomposition of the three-dimensional neighborhood to a collection of two- 
dimensional neighborhoods reduces the combinatorial properties of a discrete ob- 
ject to the collections of combinatorial properties of planar patterns [1,2]. Using 
this geometric property of neighborhood, we define the vertex angle and curva- 
ture of a discrete surface as an extension of the chain code of digital curves. These 
angles on the boundary provide a relation between the point configurations on 
the boundary and the Euler characteristic of a discrete object. In reference [5], 
Toriwaki and coworkers applied this idea for the definition of the Euler charac- 
teristic of a discrete object. Furthermore, they derived a Boolean function for 
the computation of the Euler characteristic [6]. Their method requires all points 
in the region of interest since they defined the topological properties of discrete 
object using all points in the region of interest. Our method only requires the 
point configurations on the boundary. We second introduce a new transform for 
the binary digital set, which we call “digital curvature flow.” Digital curvature 
flow is a digital version of curvature flow [4]. We have already proposed a dis- 
crete version of curvature flow which describes the geometry of curvature flow 
for polyhedrons and polytopes [3]. Digital curvature flow describes the geomet- 
ric flow which is controlled by the curvature of the boundary of binary digital 
images in a discrete space. 

Distance transform and skeletization based on distance transform are the 
classical transformations for the derivation of global features of binary digital 
images by constructing the medial axes of images. On the other hand, the AND- 
pyramid extracts features by shrinking the original images. The transformation 
provides the hierarchical expression of global and local features of images. There 
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are continuous versions for both medial axes extraction and shrinking for fea- 
ture extraction. The former and latter are the medial axes equation and the 
scale-space analysis. These transforms are expressed as evolution equations with 
respect to scale parameters. They permit the multiscale expression of features. 
There is one more diffusion-like transformation for shapes, namely, curvature 
flow, which also permits multiscale description of geometrical and topological 
features of shapes. In this paper, we deal with a complete digital version of this 
third type of transform. Using the Euler characteristic of discrete point set, we 
can detect collapses of topology and control the topology of shapes and objects 
during the deformation by flow-based processing. 

The method of snakes extracts the boundaries of the regions by deforming 
the boundaries dynamically. For practical computation of the boundary using 
snakes [11], we must solve a variational problem using an appropriate numeri- 
cal method [12]. In digital image analysis, image data are expressed in digital 
space [13]. Therefore, in this paper, we propose a digital version of this vari- 
ational problem for boundary detection using the point configurations on the 
boundary. Furthermore, we prove that the digital boundary detected based on 
mathematical morphology [14] is derived as the solution of this digital variational 
problem. 

2 Connectivity and Neighborhood 

Setting and to be two- and three-dimensional Euclidean spaces, we ex- 
press vectors in R^ and R^ as a: = (x,y)^ and x = (x,y,z)^, respectively, 
where T is the transpose of the vector. Setting Z to be the set of all integers, 
the two- and three-dimensional discrete spaces 7? and are sets of points such 
that both X and y are integers and all x, y, and z are integers, respectively. For 
n = 2, 3 we define the dual set for the set lattice points Z" as 

W={x+^e\xGZ^}, (1) 

where e = (1, 1)^ and e = (1, 1, 1)^ for n = 2 and n = 3, respectively. We call 
Z" and Z” the lattice and the dual lattice, respectively. 

On Z^ and in Z^, 

N'*((m, n)^) = {(mil, n)^, (m, n i 1)^} (2) 

and 

N®((fc, m, n)^) = |(fc i 1, m, n)^, (fc, m i 1, n)^, (fc, m, n i 1)^} (3) 

are the planar 4-neighborhood of point (m,n)^ and the spatial 6-neighborhood 
of point {k,m,n)^ , respectively. We assume the 4-connectivity on Z^ and the 
6-connectivity in Z^. 

Setting one of x, y, and 2 : to be a fixed integer, we obtain two dimensional 
sets of lattice points Z^((fc, m, n)^), Z 2 ((fc, m, n)^), and Z|((fc, m, n)^) which 
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pass through point and are perpendicular to vectors, ei = (1,0,0)^, 

62 = (0, 1,0)^, and 63 = (0,0, 1)^, respectively. These two-dimensional discrete 
spaces are mutually orthogonal. Denoting N^((fc, m,n)^) to be the 
4-neighborhood of point (fc, m,n)^ on plane Zf((fc,m,n)^) for i = 1,2,3, we 
have the relationship 

N®((fc, m, n)^) = N^((fc, m, n)^) U N2((fc, m, n)^) U N3((fc, m, n)^). (4) 

Equation (4) implies that the 6-neighborhood is decomposed into three mutually 
orthogonal 4-neighborhoods. 

A pair of points (m,n)^ and x G N"^((m,n)^) is a unit line segment in T? . 
A pair of points {k,m,n)^ and x G N®((fc,m,n)^) is a unit line segment in 
Z^. Four 4-connected points which form a circle define a 4-connected discrete 
2-simplex [8]. Four 6-connected points which form a circle define a unit plane 
segment in Z^ with respect to the 6-connectivity. Six 6-connected points in a 
2x2x2 cube form a 6-connected discrete 3-simplex in a discrete space [8]. 
Since an object is a complex, n-simplexes of an object share (n — l)-simplexes. 
However, some (n — l)-simpleces are not shared by a pair of simplexes. 

We assume that our object on a plane is a complex of 2 x 2 sqare which share 
at least one edges with each other and that our object in a space is a complex of 
2x2x2 cubes which share at least one face with each other [2]. The boundary 
of an object is a collection of (n — l)-simplexes which are not shared by a pair 
of n-simplexes of an object. Thus, the boundary of our object on a plane is a a 
collection of unit line segments which are parallel to lines a; = 0 and y = 0- The 
surface of an object is a collection of unit squares which are parallel to planes 
X = 0, y = 0, and z = 0. This definition agrees with that of Kovalevsky [10] for 
planar objects. 

In Figure 1, we show simplexes, objects and the boundaries of objects in Z” 
for n = 2, 3. On the top columun of Figure 1, we show a 1-simplex (a), a 2-simplex 
(b), an object (c), and the boundary of this object on a plane. Furthermore, in 
the bottom columun of the same figure, we show a 2-simplex (a), a 3-simplex 
(b), an object (c), and the boundary of this object (d) in a space. 

Since in Z" there exist fc-simplexes for k = 1, 2, • • • , n, we define the collection 
of these simplexes as a discrete quasi-object; that is, a collection of connected k- 
simplexes for 1 < fc < n in an n-dimensional discrete space [2] . If both point sets 
A and A are not quasi-objects, we call A a normal object. 



3 Curvature and Normal Vector 

3.1 Normal Vector on Discrete Plane 

Since we are concerned with a binary discrete object, we assign values of 0 and 
1 to points in the background and in objects, respectively. On Z^, three types of 
point configurations which are illustrated in Figure 2, exist in the neighborhood 
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(a) 



(b) 



(c) 



(d) 



Fig. 1. In the top columun, the 1-simplex (a), the 2-simplex (b), an object (c), 
and the boundary of this object (d) on a plane, and in the bottom columun, the 
2-simplex (a), the 3-simplex (b), an object (c), and the boundary of this object 
in a space 



of the point with symbol x on the boundary. In Figure 2, symbols • and o indicate 
points on the boundary and in the background, respectively. Setting fi S {0, 1} 
to be the value of point Xi such that a?o = (w,n)^, Xi = {m + l,n)^, x^ = 
(m, n -I- 1)^, *5 = (m — 1, n)^, and x^ = (m, n — 1)^, the curvature of point a?o 
is defined by 

r{xo) = fk+^Yl fkfk+ifk+2, (5) 

kGN kGN 

where N = {1, 3, 5, 7} and k + 8 = k [6]. 

Setting C = to be the set of points on the boundary, we assume 

that for point Xj, only two points Xj^i and Xj+i are connected, and a triplet 
Xj and Xjj^i lies in the counterclockwise direction on the boundary. Using 
a triplet of points, aJj-i, Xj and a^j+i, we compute the normal vector of each 
point. For planar point Xj = {xj^yj)^ , we express this vector as a complex 
number Zj = Xj + iyj. We define the outward normal vector rij as 



( 6 ) 

for point Xj, since from the local configurations of triplets on the boundary, 
there exist three possibilities for the combination of aj and (3j such that both 
\aj\ and |/3j| are l/-\/2, and one of \uj\ and |/3j| is zero and the other is one. 



rij — Xaj, aj — (aj,f3j) , aj + ij3j — ( 

\Zj-i - Zj 



A,= 



i if ajf3j = \ 
^ otherwise. 
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(a) (b) (c) 



Fig. 2. Directions of the normal vectors for the positive and negative points on 
the boundary 




(-, 0 , 0 ) 

Fig. 3. Angles and configurations on the boundary of three-dimensional 6- 
connected objects 



Figure 2 shows the normal vectors of three configurations on the 4-connected 
boundary shown in Figure 2. 
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The curvature indices of configurations (a), (b) and (c) , which is defined by 
r{xo) are positive, zero, and negative, respectively, when the directions of the 
normal vectors are inward, neutral, and outward, respectively. Therefore, we call 
these configurations convex, flat, and concave, respectively, and affix the indices 
+, 0, and — , respectively. 

3.2 Normal Vector in Discrete Plane 

Using combinations of planar curvature indices on three mutually orthogonal 
planes which pass through a point aip, we define the curvature index of a point 
Xq in since the 6-neighborhood is decomposed into three 4-neighborhoods. 
Setting ai to be the planar curvature index on plane Z|(a;o) for i = 1,2,3, the 
curvature index of a point in Z^ is a triplet of two-dimensional curvature indices 
(ai, Of2, as) such that ai G {-I-,— ,0,0}. Here, if = 0, the curvature index 
of a point on plane Z^{xq) is not defined. Therefore, for the boundary points, 
seven configurations, {+, +, -h), (-f , -f , -), {+, 0, 0), (0, 0, 0), (-, -, -), {+, -, -), 
(—,0,0), and their permutations are possible. 

Since a triplet of mutually orthogonal planes separates a space into eight 
parts, we call one eighth of the space an octspace. The number of octspaces 
determines the configurations of points in a 3 x 3 x 3 cube. There exist nine 
configurations in the 3x3x3 neighborhood of a point on the boundary, since 
these configurations separate Z^ into two parts which do not share any common 
points. The same configurations have also been introduced by Frangon [9] for 
the analysis of discrete planes. The curvature analysis of discrete surfaces also 
yields these configurations. 

For a spatial curvature index a, setting n{a) to be the number of octspaces 
in the 3x3x3 neighborhood of a point, the relationships n((-|-, -I-, -I-)) = 1, 
n((+,0,0)) = 2, n((+,+,-)) = 3, n((-, -,-)+) = 4, n((O,O,0)) = 4, 
n((-, = 4, n((-h,-,-)) = 5, n((-,0,0)) = 6, and n((-,-,-)) = 7 
are held. 

The curvature indices correspond to nine configurations of points which are 
illustrated in Figure 3. Since | = 1 — we define the vertex angles of 
configurations as 7((-|-, -I-, -|-)) = 1/8, 7((-|-,0,0)) = 2/8, 7((+,+,-)) = 3/8, 
7((-, -,-)+) = 4/8, 7((O,O,0)) = 0, 7 ((-, = -4/8, 7((+,-,-)) = 
— 3/8, 7(— , 0, 0) = — 2/8, and 7((— , — , — )) = —1/8. Considering congruent trans- 
formations in Z^, since a code uniquely corresponds to a configuration, the vertex 
angle for a point on a surface is an extension of the chain code for a point on a 
curve. 

Point configurations on the boundary are defined by the discrete lines which 
lie on three or two mutually orthogonal vectors. Therefore, we define three nor- 
mal vectors ni, ri2, and ns which lie on planes perpendicular to vectors ei, 62, 
and 63, respectively. According to these definitions, for n = (a,/3)^ on each 
plane, we have ni = (0,a,/3)^, U 2 = (a, 0,/3)^, and ns = (a,/3, 0)^. Using vec- 
tor na, a = 1,2,3, we define normal vector n{x) for point x on the boundary S 
in digital space Z^ . There are nine combinations of vertex configurations in the 
3x3x3 neighborhood on the boundary. 
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The directions of the normal vectors for the boundary configurations shown in 
Figure 3 are determined according to the combinations of two and three normal 
vectors on mutually orthogonal three planar discrete boundaries. Therefore, we 
express the normal vector n{x) of point x as 

n{x) = ^Na{x) = + 02 n 2 + (7) 

for matrix N = (ni,n 2 ,^^ 3 )- For codes +, — , 0, and 0, setting s(±) = ±1, 
s( 0 ) = 0 , and s( 0 ) = 0 , we define a function as 

/(«!, 02, as) = (s(ai) + 5 ( 02 ) + 5 ( 03 )) X (|s(oi)| + |s(o 2 ) + |s(o3)|)- ( 8 ) 

This function takes values of {0, ±1, ±3, ±9} according to the configurations on 
the boundary. We define the relation between a{x) = (o;i,a 2 ,Q! 3 ) and trinary 
coefficients a{x) = ( 01 , 02 , 03 )^ for point x as 

o(a:)(±,0,0) (0,±,0) (0,0, ±) (±,±,±) 

a(a;) ( 1 , 1 , 1 ) ( 1 , 1 , 1 ) ( 1 , 1 , 1 ) ( 1 , 1 , 1 ) 

a{x) (±, ±, =f) (±, T, ±) (T, ±, ±) /Q,, 

a(a:) ( 1 , 1 , - 1 ) ( 1 , - 1 , 1 ) (- 1 , 1 , 1 ) 

o(a:) ( 0 , 0 , 0 ) ( 0 , 0 , 0 ) ( 0 , 0 , 0 ) 

a(a:) ( 1 , 1 , 0 ) ( 1 , 0 , 1 ) ( 0 , 1 , 1 ). 

Then, a positive or negative of the sign of function /(ai,a 2 ,a 3 ) indicates the 
direction of normal vector n{x) at point x on the boundary to be outward or 
inward, respectively. 

4 Curvature and Topology 

4.1 Euler Characteristics of 4- and 6 -Connected Objects 

When a closed simple curve C on a plane is expressed as vector a;(s) for 0 < 
s < S, where S is the length of the curve, x = (cos 0(s), sin 0(s))^ denotes the 
tangent vectors of this curve at point x{s). We assume that vector x{s) moves on 
the boundary so that vector x^ = (— sin 0 (s), cos 0 (s))^, which is perpendicular 
to X, directs inward. The function 9{s) such that — tt < 6{s) < tt satisfies the 
Gauss-Bonnet formula 

9(s)ds = 27 t( 1 — g), (10) 

where g is the number of holes of this object. Setting n+ and n_ to be the num- 
bers of positive and negative points for point configurations on the 4-connected 
boundary on a plane, respectively, we obtain the following theorem as discrete 
version of the Gauss-Bonnet formula. 

Theorem 1. Let g be the number of holes of a planar discrete object. If a dis- 
crete object is j-connected, the relation n+ — n_ = 4(1 — g) is held. 
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Fig. 4. Configurations in a 3 x 3 region on the 8-connected boundary graphs. 
We show the code of the point with the symbol x , and points with symbols • 
and o belong to the object and the background, respectively 



Setting Ua to be the number of configurations on the boundary whose an- 
gle is a/8, for vectors n = (n_ 4 , n_ 3 , n_ 2 , n_i, no, ni, ri 2 , ns, 714 )^, and a = 
(— 2, — 1, 0, 1, 0, 1, 0, — 1, — 2)^, we obtain the following theorem as the three- 
dimensional analogous of this theorem for planar sets. 

Theorem 2. For an object with g tunnels, the object holds the relationship 
n = 8(1 — g). 

4.2 Euler Characteristic of 8-Connected Objects 

Next, we deal with 8-connected objects on Z^. A point set 

N®((m, n)^) = {(m±l,n±l)^,(m±l,n±l)^} (11) 

is the planar 8-neighborhood of point (m,n)^. Triangle with vertices (m,n)^, 
(m -|- l,n)^, and (m,n -I- 1)^, and its congruent triangles in are two- 
dimensional simplexes on 8-connected discrete plane [8]. 8-connected object is a 
complex of these simplexes. Therefore, in the 3x3 regions along the 8-connected 
boundary, there exist 10 configurations, as illustrated in Figure 4. We assign the 
curvature codes —3/2, —1/2, —1, —8, 0, 08, -1-8, -1-1, -1-1/2, -1-3/2 to these configu- 
rations and we express the numbers of these configurations on the boundary of an 
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object as p-, m_, n_, s_, no, sq, s+, n+, m+, and p+. For point configurations 
on the 8-connected boundary, we have a discrete version of the Gauss-Bonnet 
formula. 

Theorem 3. Let g he the number of holes for an 8-eonnected object. Then, 
points on the boundary satisfy the relation 

(m+ -I- 2n+ -I- 2s+ -|- 3p+) — (m_ -I- 2n_ -I- 2s_ -|- 3p_) = 8(1 — g). (12) 



5 Digital Curvature Flow 

5.1 Deformation on Discrete Plane 

Using the outward normal vector for each point on the boundary, we define a 
transform from point set C on 7? to point set C on as 

- if 7(a;j) yf 0, 

^ \xj- rij ± inj , if -/(xj) = 0. 

This transform controls the deformation of points based on the local geometrical 
properties of the discrete boundary. Therefore, we derive a transformation using 
the global information of concavity of the discrete boundary. First, using the 
curvature code ’j{xj) of each point Xj, we classify points on boundary C into 
types N+, and N_. First we define 

N_(j) = {xp\-f{xf}) = 0, j < P < j + m, s{j{xj)) X s{-f{xj+rn)) = -1}, (14) 

where 5(7(0?)) = 1 and 5(7(0?)) = -1 for (7(0?^), 7(0?^-+^)) = (+, -) and 
(j{xj),'-f{xj+jn)) = (—,+), respectively. Second, we set 

N_ =|jN„(j), N+ = C\N_. (15) 

j 



Each N(_ is a sequence of flat points whose one endpoint is the concave point. 
Furthermore, N_ is the union of these sequences on the boundary. Then, we 
have the relation 

C = N+|Jn_, N+p|N_ =0. (16) 

Points in N+ and N_ lie in convex and concave parts on the original boundary. 
The configurations of positive and negative end points and the lines determined 
by the signs of the endpoints are shown in Figure 5. 

Using these geometrical properties of point sets N+ and N_ on the boundary, 
we define a global transform from point set C on to point set C on Z^ as 

(xj-rij, if7(a?j)yf0, 

Xj = ^ Xj - rij ± if j{xj) = 0, Xj G N+, (17) 

[ Xj + rij ± , if j{xj) = 0, Xj G N_. 




238 Atsushi Imiya 



In Figure 5, we show the direction of motion for flat points during the defor- 
mation based on the curvature vectors of a discrete curve. Although this trans- 
formation transforms a point on a corner to a point, a flat point is transformed 
to a pair of points. Therefore, for a corner, this transformation acts as curvature 
flow, though for flat points this transformation acts as diffusion on the boundary 
curve. We call the motion of boundary points caused by the successive appli- 
cation of this transformation digital curvature flow. Therefore, odd and even 
steps of digital curvature flow transform points on the lattice to points on the 
dual lattice and points on the dual lattice to points on the lattice, respectively. 
Pyramid transform is also defined using the lattice and the dual lattice. The 
first and second steps of pyramid transformation transform functions defined on 
the lattice to functions defined on the dual lattice, and functions defined on the 
dual lattice to functions defined on the lattice, respectively. More generally, the 
odd and even steps of pyramid transformation transform functions defined on 
the lattice to functions defined on the dual lattice, and functions defined on the 
dual lattice to functions defined on the lattice, respectively. 

From vectors ei = (1,0)^ and 62 = (0,1)^, we define vectors do = ^eo, 
di = 562 , do = —do, and dj = —di. Furthermore, setting a = 1 — a and 
j3 = 1 — /3, we define vector da /3 = d^ + dp. Vector dap satisfies the relation 
dap = —dap. According to the definition of normal vector rij at each point, 
rij is an element of vector set D = Dq U D±, where Dq = {do, di, dg, dj}, 
and D± = {dgo, dio, dn, dgi}. Later in this section, we express the sequence of 
curvature codes as T = (7172 • • -^n) for 7 ^ = 'ji+N- Setting the transformation 
from curvatures to normal vectors as 



£» : T= (did2 ---dAr), (18) 

we have the relations in the 3x3 neighborhood as 

D : (...Q-HO---) = (•••d---), de D±, 

I? : (•••O-O---) = ( d---), deD±, (19) 

D : (•••OOO---) = (•••d---), de Dg. 

As seen in Figures 5 and 6, the transformation defined by eq. (17) does not 
locally change the numbers n+ and n_, where n+ and n_ are the numbers 
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Fig. 5. Motions of the positive and negative lines on the boundary 
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of convex and concave points. For three configurations (a), (b) and (c), shown 
in Figure 5, the transformation from a lattice to a dual lattice transforms the 
sequences of curvature codes as 

F: (•••0 + 0__^+0---) = (•••0 + 0_^+0---), 

N N-1 

F: (•••0 + 0__^-0---) = (•••0+ (20) 

N N-l+S 

N N-1 

for d S {0, 1}. The first and third configurations shorten the curve one units. 
However, the second configuration preserves the local length of the boundary 
curve, or shortens one units. Therefore, considering the relation n+ — n_ = 4, in 
each step, four positive points move inward, this motion shortens the curve by 
4x1 = 4 units. 

Third, we show configurations in regions of intermediate size. For the config- 
uration (• • • 00 — 0 -I- 0 -I- 0 — 00 • • •), we have the relations 

: (• • • 00 - 0 -h 0 -f 0 - 00 • • •) = (• • • 00 • • •), (21) 

where F^ for k > 1 expresses that operation F is applied k times. 



In Figure 7 (a), we show configurations of points derived by the deformation 
based on eq (17). In the figure, the configuration on top is the original configu- 
ration, and the second one is the configuration for the next step. Furthermore, 
successive application of the transformation yields the configuration shown at 
the bottom. This example suggests that the boundary curve on a digital plane 
is smoothed by digital curvature flow. 

Finally, we examine asymptotic properties of boundary-curve evolution un- 
der digital curvature flow. In Figure 7 (b), we show an example of the deforma- 
tion process for a digital curve. If we consider discrete quasi-objects in digital 
space [2] , a line segment shrinks to a point under digital curvature flow. From the 
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Fig. 6. Local motion of the positive and negative lines on the boundary 
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(a) (b) 

Fig. 7. Examples of a local deformation process (a) and a global deformation 
process (b) 



definition of the transformation, points in N+ and N_ move inward and out- 
ward, respectively. Furthermore, convex and concave points also move inward 
and outward, respectively. Moreover, the relation n+ — n_ = 4 for corner points 
is preserved. Therefore, n_ finally becomes 0, since all line segments which con- 
nect convex points and concave points are eliminated from the boundary. If we 
apply curvature flow to a rectangle, we obtain a line segment. These considera- 
tions lead to the following theorems. 

Theorem 4. For planar curves, in each step, digital curvature flow shortens 
the boundary curve by 4 unit lengths if there is no hole. 

Theorem 5. Digital curvature flow preserves topology if there is no hole. 

Theorem 6. Digital curvature flow transforms a boundary curve into a rectan- 
gle. 

Theorem 7. The final form produced by digital curvature flow is a line segment. 

We also examine geometric properties of digital curvature flow for planar 
objects with holes. For the computation of outward normal vectors for the inner 
boundary curves, we should follow the curves in the clockwise direction. The 
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directions of outward normal vectors are the same as the directions of the inward 
normal vectors of curves if we assume that these curves are the outer boundary 
curves of appropriate regions. The motion of sequences of flat points on the inner 
boundaries are the same as the motion of flat points on the outer boundary curve. 
Therefore, each inner boundary converges to a line segment. This property leads 
to the following theorem. 

Theorem 8. Inner boundaries of a planar digital object converge to line seg- 
ments. 

The boundary curve encircles a collection of inner boundary curves. For a pair 
of curves, the following property is held. 

Property 1 Assuming that one curve encircles the other curve, the inner curve 
converges faster to a line segment. 

This property implies the next theorem since the convergence speed of small 
inner boundaries is faster than the convergence speed of outer boundaries. 

Theorem 9. If sizes of holes are small, the number of line segments in the 
rectangle is equivalent to the number of holes of the original object. 



5.2 Deformation in Discrete Space 

As mentioned in the previous section, we can define the signs of vertices and the 
points on edges. Using the codes for vertices and edges, we deflne codes for flat 
points on the boundary surface. 

If an end of a line segment is negative, we say that this line segment is a 
negative line segment. Furthermore, if both ends are positive, we say that this 
line segment is positive line segment. For boundary points whose two-dimensional 
codes are zero, we affix curvature codes with the same codes with the codes of 
line segments on which points lie. 

This rule derives the code (a,/3i,/32), (/3i,a,/32), and (/3i,/32,a), where 
Pi G {+,—}, for a point whose codes are (a, 0,0), (0,0, 0), and (0,0, a), where 
a G {-I-,— ,0}. These notations lead to codes (-I-, -I-, -I-), (-1-,— ,— ), 

(0,-1-, -I-), (0,-1-,—), (0,—,—), and their permutations. Using these 
codes, we define the code of the flat points and points on edges as 



g{a,Px,P2) 



+, if f{a,Pi,P2) = 9, 
— , otherwise. 



(22) 



for (a,Pi,P 2 ) and its permutations. 

Setting 

h{Pi,P 2 ) = (siPi) + s{P 2 )) X (|s(/3i)| + \s{P 2 )\), (23) 

we set the codes of flat points and points on edges as positive and negative, if 
h{Pi,P 2 ) > 0, and otherwise, respectively. For these curvature codes, we define 
that the code of a flat point is negative if the number of the negative codes in 
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the curvature code is positive. Furthermore, we move flat points outward and 
inward, if the codes are positive and negative, respectively. 

Two line segments, which are mutually orthogonal, pass through on the flat 
points with codes (0,0,0), (0,0,0), and (0,0,0). Therefore, using the codes of 
end points of these two line segments on a plane, we can affix the codes of flat 
points. If at least one end point is negative, we assgin negative to a point on 
this line. There are six possibilities for the configurations of end points of two 
mutually orthogonal line segments, and the codes of common point of these two 
line segments are shown in Figure 8. These definitions for codes also conclude 
that the codes of points with curvature codes (+,0,0), (0,+,0), and (0,0,+), 
on an edge are negative if one end or both ends are negative. In Figure 9, we 
show examples of the curvature codes (+,0,0) on edges and a flat point which 
are derived using the six configurations in Figure 8. 




Fig. 8. The configurations of end points of mutually orthogonal line segments 
and the signs of flat points 



The normal vectors on the discrete surface defined in section 3.2 are in the 
form ^ 

n{x) = -{aiBi + a 2 B 2 + Uses) , (24) 

where Ui G { — 1,0, 1}. Therefore, if Oi = 1 and Ui = —1, the vector (x + n{x)) 
determines the transformation from a: G to y G Z^. Moreover, if and only 
if the codes are (+,+,+), (—,—,—), and (— ,— ,— )_and for a ,/?, 7 G 
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N- 



N- 



N- 



(c) 



Fig. 9. Examples of the signs of flat points and edge points 



{+,—}, the vector (x + n{x)) determines the transformation from x G to 

yGl?. 

For the code , (a, 0,0), (a, 0,0), and (0,0, a) for a G {+,— ,0}, 

the normal vectors hold the relations 



a(a:) (0,0,0) (0,0,0) (0,0,0) 

nix') 0 0 0 

a(a:)(±,0,0) (0,±,0) (0,0,±) 

n{x) ±5(0262 ± 0363) ±5(0161 ± 0363) ±5(0161 ± 0262). 



(XiG-i -\- CLjGj 



(25) 



However, these normal vectors do not define the transformation from points in 
to Z^, Therefore, setting 



a{x) (0,0,0) 
m{x) ±561 
a{x) (±,0,0) 



( 0 , 0 , 0 ) ( 0 , 0 , 0 ) 

n{x) ± 562 n{x) ± 563 

( 0 ,±, 0 ) ( 0 , 0 ,±) 



m{x) ±5(0262 ± 0363) ±5(0161 ± 0363) ±5(0161 ± 0262), 



we define the transformation from a; G Z^ to y G Z^ as 



^n(x) ± iefc 



(26) 



y + x + em{x), (27) 

where e is 1 for codes (±,±,±), (-,-,-), (-,-,-)_, (±,±,-), (±,-,±), and 
(— ,±,±) and e is —1 for points where g{a,f 3 i,P 2 ) is negative. Therefore, the 
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negative sign of e is depends on the configuration of two mutually orthogonal 
line segments which pass through the points. Since the saddle points on a dis- 
crete dambbell move inward, our method preserves the topology of the dambbell. 
This is the significant difference from the curvature-flow-based deformation in 
the three-dimensional Euclidean space R^. However, according to the defini- 
tions of the directions of the motion of points in curvature-based flow, positive 
points on edges with both end points negative move outward. These configura- 
tion defines the outword motion for points on the bar of a dumbbel. Therefore, 
these definitions of the codes of flat points and edges preserve the topology of 
dumbbells. 

Curvature flow transforms each planar curve segment which passes through 
lattice points on a polyhedral boundary into a straight line segment which passes 
through lattice points on a polyhedral boundary. Furthermore, each closed curve 
on a plane is transformed to a rectangle. If a(x) = (-I-, -I-, -I-), on each step, 
curvature flow eliminates a 3/4 unit area from the corners. Moreover, positive 
and negative parts on the boundary moves outward and inward, respectively. 
These considerations lead to the following theorems assuming that an object 
does not contain any tunnels. 

Theorem 10. For a closed surface, in each step, digital curvature flow shrinks 
the boundary curves on each plane six unit areas from corners such that ^{x) = 
(+>+,+)• 

Theorem 11. Digital curvature flow preserves topology if there is no tunnel. 

Theorem 12. Digital curvature flow transforms a boundary into a rectangle, if 
there is no tunnel. 

Theorem 13. The final form of digital curvature flow is a spatial rectangle if 
there is no tunnel. 

In Figures 10 and 11, we show examples of discrete deformation. These exam- 
ples show that this deformation preserves the the topology of a discrete dambell. 

6 Discrete Variational Method 

Setting f{x) to be a gray- valued image in a three-dimensional space, a three- 
dimensional version of snakes detects a boundary surface S by minimizing the 
criterion, 

J = [ {a + (3\K\)ds - 7 / F{\Wf{x)\x^s)ds (28) 

Js Js 

for positive constants a, (3, and 7 , where K and ds are Gauss curvature on 
the surface S, and the infinitesimal area on the surface S, respectively. It is 
possible to select positive function F{-) such that if S passes through locally 
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Fig. 10. An example of deformation. The discrete dambbell at the top is de- 
formed into a parallelepiped at the bottom 



steepest points, the second term becomes large. We set F{x) = x. For gray- 
valued images, all points on the boundary should have the same gray value. 
Therefore, we adopt the minimization criterion 

E= J + S [ \Vsf{x)\ds, (29) 

Js 

where 5 is a positive constant and Vg is the gradient on surface S. For quantized 
digital images, the second term of eq. (29) becomes the minimum value zero if 
S lies in one of two regions B and W = B for binary images, where B is the 
region with value 1, namely, the black region. We call W the white region. 

If we assume 6-connectivity, the first and second terms are an and /3fc, re- 
spectively, where n and k are the total numbers of points on the boundary and of 
nonflat points, respectively. For fijk = k) for integers i, j and fc, we define 

the discrete gradient as f)ijk = {uijk,Vijk,WijkV , for Uijk = fi+ijk~ fi-ijk, 
Vijk = fi j+i k- fi j-ik, and Wijk = fi j k+i ~ fi j-i k-i- These equations lead to 
the relationship, for binary images, 



— Cl -|- C2 “h C3, 



(30) 
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Fig. 11. An example of deformation. The object in the left is tansformed into 
the cube on the right 



where, for a = 1, 2, 3, 



f 1 , if 7a(p) > 0 
^ 0, otherwise, 



(31) 



where ”fa{p) is the two-dimensional curvature code of point p on the plane 
perpendicular to vector Bq. Setting mi and mo to be the numbers of black 
points and white points on the boundary surface of binary images, we obtain a 
criterion 

r 0, if S G B, 

g{S) = <^ 0, if S G W, (32) 

[ I mi — mol, otherwise. 

These considerations lead to the relation 



Ed{v) = an + Pk — jf + Sg, (33) 

where / is the total sum of -I- \vijk\“^ + on the boundary surface. 

Theorem 14. Ed takes the minimum value if S is a 6-connected digital sur- 
face. 

(Proof) Let the numbers of points and vertexes in A be n' -I- 1 and k', respec- 
tively. Assuming that the boundary surface is 6-connected surface, the local 
deformation of this surface is described as S' = S \ {p}lJA, where a point p 
and a point set A are a point on S and a subset of B or W, respectively. There 
are two possibilities for the selection of a point p, since there are flat points 
and nonflat points on the boundary. Furthermore, for nonflat points, there are 
8 configurations listed in Figure 3. There are two possibilities for the selection 
of a set A which is replaced to a point p. Therefore, setting 



AEd = Ed{S')-Ed{S), 



(34) 



the combinations of {p} and A, based on nine configurations of a point on the 
boundary shown in Figure 3 and two possibilities for the selection of a set A, 
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lead to the relation 





AEd = an' + j 3 {k' + a — e) — + 6 { 9 \n — n' — 2|) 


(35) 


for 




r 0, if 7(p) = (0,0,0), (0,0,0), (0,0,0), 
[ 1, otherwise. 


(36) 




a=] 


f 6, if i is even for vertex angle i/ 8 , 
[ 8, otherwise. 


(37) 






r 0, if 7(P) = (0,0,0), (0,0,0), (0,0,0), 

[ m(p), otherwise. 


(38) 


where m{p) is 


the difference between the total of the gradients of the boundaries 


S' and S, and 




r0,if AcB, 
} 1, otherwise. 


(39) 



Converting a point set {p} to a set of points A, the total sum of gradient on the 
surface decreases, since the gradient is zero on each point in A. Therefore, m{p) 
is always negative. These relations lead to the conclusion that AEjj is always 
positive. Therefore, a locally optimal boundary surface is 6-connected. (Q.E.D.) 

The 6-connected surface is computed by S = B \ (B © N^®), where A 0 B 
is the Minkowski subtraction [14] of point set B from point set A and is 
the 26-neighborhood of the origin which is defined as N^®((fc, m, n)^) = {(fe ± 
ei,m± £2,n± £3)^} for £i,e2,£3 G {0,1}. _ 

Setting h, k, /, and g to be the parameters for S, we have the equalities 
and inequalities fi > n, k = k, f < f, and g = g = 0 , if both S and S are not 
quasi-objects. Here, both S and S are not quasi-objects, if both B and W = B 
are not quasi-objects. Therefore, we have the following theorem. 

Theorem 15. //B is a normal object, we have the inequality Eu{S) < Ez){S). 

Theorem 15 implies that a 6-connected boundary surface is the optimal solution 
for the minimization criterion E^ if we select an initial surface in the neighbor- 
hood of B in W. 

For a surface in the interior of B, the third and fourth terms are always 
zero. It is possible to select a surface such that n+i = 8 and n-a = 0 for o; = 
1,2,3, 4, which is a parallelepiped. This surface Si satisfies the relation if_D(S) > 
Ei){Si). Furthermore, in the exterior of W, we can also select a surface Ve such 
that n+i = 8 and n_Q = 0 for cr = 1, 2, 3, 4, which is a parallelepiped, satisfying 
the relation Ed{S) > Ejj{Se)- Although the existence of these surfaces depends 
on function F and the selection of parameters a, /3, 7, and <5, these properties 
lead to the following theorem. 

Theorem 16. The minimization criterion Ed for binary images is not globally 



convex. 
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7 Conclusions 

In this paper, we first defined the curvature indices of vertices of discrete ob- 
jects [3]. Second, using these indices, we defined the principal normal vectors 
of discrete curves and surfaces. Third, we defined digital curvature flow as a 
digital version of curvature flow in a discrete space. Our treatment of curvature 
flow is also considered as a numerical method for partial differential equation 
of curvature flow in a grid space. Finally, these definitions of curvatures in a 
discrete space derived discrete snakes as a discrete variational problem since the 
minimization criterion of the snakes is defined using the curvatures of points on 
the discrete boundary. Furthermore, we proved that the digital boundary curve 
detected using mathematical morphology is derived as the solution of this digital 
variational problem. 

Images and objects in computers are expressed in digital form. Therefore, in 
computer vision, we first describe problems for the variational forms using con- 
tinuous functions, and second discretize the problems for the numerical compu- 
tation. In this paper, we directly converted a variational problem of a continuous 
functional into an optimization problem of a discrete functional which is defined 
in the lattice space. 
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Abstract. Our theory of Hausdorff discretization has been given in the 
following framework [10]. Assume an arbitrary metric space {E, d) {E can 
be a Euclidean space) and a nonvoid proper subspace D oi E (the discrete 
space) such that: (1) D is boundedly finite, that is every bounded subset 
of D is finite, and (2) the distance from points oi E to D is bounded; 
we call this bound the covering radius, it is a measure of the resolution 
of D. For every nonvoid compact subset K oi E, any nonvoid finite sub- 
set S oi D such that the Hausdorff distance between S and K is minimal 
is called a Hausdorff discretizing set (or Hausdorff discretization) of K\ 
among such sets there is always a greatest one (w.r.t. inclusion), which 
we call the maximal Hausdorff discretization of K. The distance between 
a compact set and its Hausdorff discretizing sets is bounded by the cover- 
ing radius, so that these discretizations converge to the original compact 
set (for the Hausdorff metric) when the resolution of D tends to zero. 
Here we generalize this theory in two ways. First, we relax condition 
(1) on D: we assume simply that D is boundedly compact, that is ev- 
ery closed bounded subset of D is compact. Second, the set K to be 
discretized needs not be compact, but boundedly compact, or more gen- 
erally closed (cfr. [15] in the particular case where E = R” and D = 7H). 

Keywords: Distance, metric space, compact set, proximinal set, bound- 
edly compact set, discretization, dilation, Hausdorff metric. 



1 Introduction 

In [10] we indroduced a new approach to discretization, based on the Hausdorff 
metric: given a nonvoid compact subset K of the “Euclidean” metric space E, a 
possible discretization of K is a finite subset S of the “discrete” nonvoid proper 
subspace D of E, whose Hausdorff distance to K is minimal. Since there can be 
several such sets S, we called any such S a Hausdorff discretizing set (or Haus- 
dorff discretization) of K , and used the term maximal Hausdorff discretization 
of K for the greatest among those Hausdorff discretizing sets. (Note that the 
latter always exists, because the family of Hausdorff discretizing sets is nonvoid, 
finite, and closed under union). 
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For example, take E = R", = Z". We associate to each p £ D the closed 

cell C{p) consisting of all Euclidean points x € E which are at least as close to p 
as to any other point in D: 

ypGD, C{p) = {x € E \yq G D, d 2 {x,p) < d 2 {x,q)} , (1) 

where c ?2 denotes the Euclidean distance. The supercover discretization Asc 
associates to every X C E the set of all p G D such that C (p) intersects X (see 
Fig. 1): 

VX C E, Asc{X) = {pGD\ C{p) n X ^ 0} . (2) 

Then given a metric on E induced by a norm, for every nonvoid compact sub- 
set K of E, Asc{K) is a Hausdorff discretizing set of K [11]. 

Such a discretization follows a global approach, in the sense that whether a 
point p G D belongs to a Hausdorff discretizing set of K or not, may depend 
on points of K which can be at an arbitrary distance from p (see Fig. 2). We 
showed in fact that the maximal Hausdorff discretization of K consists of all 
points p G D such that the closed ball of radius r//(X) centered 

about p, intersects K; here that radius r//(X), called the Hausdorff radius of K, 
depends globally on K (and on D): it is the maximal distance from a point of K 
to the closest point of D. It is precisely the dependence of rn{K) on K that 
makes Hausdorff discretization global. 

However, as explained in [10,11], this does not mean that our theory is incom- 
patible with the usual practice of discretizing objects by means of local opera- 
tions. We have already given above the example of the supercover discretization, 
which is purely local, but nevertheless gives always a Hausdorff discretizing set 
when the distance is based on a norm. In [11], we have also shown that in any 
case, all Hausdorff discretizing sets lie in a precisely defined neighbourhood of 
the supercover discretization. 

The main interest of our approach is that the Hausdorff distance between 
a nonvoid compact set K and its Hausdorff discretizing sets is equal to the 
Hausdorff radius rniK). Now a measure of the resolution (or grid spacing) of the 
“discrete” space D is what we call the covering radius, namely the supremum r^ 
of the distances of points of E to D. We assume that r^ < oo, and we have 
always rniK) < rc- A consequence of this is that when the resolution D tends 





Fig. 1. a) The structuring element A is C(o). b) A Euclidean set X overlayed 
with the discrete points p and their cells C{p). c) The supercover discretization 
Asc{X) 
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Fig. 2. Left: A compact set K = A U i? U C overlayed with discrete points 
p,q,r,s and their cells C{p),C{q),C{r),C{s). Right: For d = d 2 (the Euclidean 
distance), the maximal Hausdorff discretizations of A, B and C are {p}, {g}, and 
{s} respectively, while the maximal Hausdorff discretization of K is {p, g,r, s}. 
We show the circles of radius rniK) centered about these points. The additional 
point r arises because its Hausdorff distance to B is less than the Hausdorff 
distance of p to A. The unique other Hausdorff discretizing set of K is {p, g, s} 



to zero, so does rniK), and the Hausdorff discretization tends to the compact 
set in the Hausdorff metric sense. 

Many theories consider the discretization of closed subsets of the Euclidean 
space R" into subsets of the discrete space Z”. This is the case of the morpholog- 
ical theory of discretization by dilation [8]. In [15] we also presented the theory 
of Hausdorff discretization of closed sets in the case of E = R" and D — 7A . 

As we will see, the main results of our abstract framework [10] can be 
extended to closed sets, in particular the fact that the Hausdorff distance 
between K and its Hausdorff discretizing sets is equal to its Hausdorff ra- 
dius rniK), and so is bounded by the resolution of D. Our characterization of 
Hausdorff discretization takes exactly the same form as for compact sets in [10], 
for a class of closed sets called proximinal sets, that is those where for every point 
outside the set, some point in the set minimizes the distance to it. Note that if 
the space E is boundedly compact, that is every bounded closed set is compact, 
then every closed set is proximinal. For example R" (n € N) is boundedly com- 
pact. In fact, a normed vector space is boundedly compact iff it is isomorphic to 
R" for some n G N. 

Also we extend our theoretical framework by relaxing one of the two axioms 
for D postulated in [10]. There we assumed that every bounded subset of D is 
finite. Now we make the weaker assumption that every bounded closed subset 
of D is compact, in other words D is boundedly compact. In some way, the 
space D is not necessarily discrete, so instead of a discretization we may speak 
of a sampling. 

What is the interest of this generalization, namely of taking closed sets in- 
stead of compact ones, and of sampling in a space D that is not necessary dis- 
crete? First, taking into account unbounded objects allows one to give models 
for the discretization of standard unbounded shapes like straight lines, parabo- 
las, etc. Second, by assuming the space D to be boundedly compact instead of 
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discrete (i.e., boundedly finite), we can give a complete theory of sampling on 
any kind of grid with variable resolution, especially if that grid has accumulation 
points. For example, in the log-polar model for circular images, where pixels are 
positioned by sampling the angle and the logarithm of the radius, the origin is 
an accumulation point of such a grid. Although the accumulation point is ex- 
cluded from the digital model, it has to be taken into account in any theory 
that deals with all possibles resolutions (in the same way as the theory of real 
numbers is necessary to understand digital numbers with arbitrary precision). 
Another use of our theory is when one discretizes an object only in some of its 
coordinates, so one associates to a closed subset F oi E = R“+^ a closed sub- 
set S oi D = R“ X such that Hd{S, F) is minimal; again D will be boundedly 
compact, but not boundedly finite. A related problem is the sampling without 
quantization of numerical functions, which transforms a function R" — > R into 
a function Z" ^ R; a geometrical approach to such a sampling will consider the 
graph (or hypograph) of the original function as a subset oi E = R"+^, and will 
derive from it the graph (or hypograph) of the sampled function as a subset of 
the boundedly compact subspace D = Z” x R. 

The approach of minimizing the Hausdorff distance has also been used by 
Sendov [13] to approximate a given function / by a function in some class S\ one 
takes the function g G S such that the Hausdorff distance between the graphs 
of / and g is the smallest possible. 



Paper Organization 

In Sect. 2 we recall certain known facts about closed and compact sets in a 
metric space, as well as related types of sets: boundedly compact, boundedly 
finite, and proximinal sets. We recall also the Hausdorff metric on compact sets 
and its extension to closed sets, and prove two results on compact families (in 
Hausdorff spaces) of compact sets, which are probably known, but do not appear 
in standard textbooks. Then in Sect. 3 we recall the main results of [10] and 
extend them to the case where the subspace H of if is boundedly compact 
(instead of boundedly finite). Section 4 extends further the theory by considering 
the sampling of closed sets (instead of compact ones in [10]), with particular 
results for boundedly compact sets. The Conclusion describes the place taken 
by these results in our research programme. 



2 Some Families of Closed Sets in a Metric Space 

We recall first some elementary facts from topology, especially in the case of 
metric spaces. Our survey goes deeper than in [10], and the reader is referred 
to [2,4, 5, 9] for more details. Then in Subsect. 2.1 we give the definition and main 
properties of boundedly compact, boundedly finite, and proximinal subsets of 
a metric space. The definition and basic properties of the Hausdorff metric for 
these sets are given in Subsect. 2.2. Finally in Subsect. 2.3 we give two important 
results on compact families of compact sets in a metric space. 
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We adopt a systematical terminological convention. We will define several 
families of subsets of a topological or metric space E] if we denote S the family 
of subsets of E having some property cr, we write S' for the family of nonvoid 
elements of 5; when the space E in which it is defined must be specified, we will 
write S{E) and S'{E). Now for X C E, we will write 5(W) (resp., 5'(X)) for 
the analogous family in the relative topology or metric of X, namely the one of 
subsets (resp., nonvoid subsets) of X having property a in that relative topology 
or metric. 

Let (E,Q) be a topological space; here Q designates the family of open subsets 
of E. Write T for the family of closed subsets of E. For X C E, write X for the 
closure of X. 

We say that E is Ti if for two distinct x,y G E, there is some open set G 
such that X G G but y ^ G\ equivalently, every singleton in E is closed. When E 
is Ti, for every X C E and y G E, the following two statements are equivalent: 
a) for every neighbourhood V of y, V \ {y} contains a point of X; b) for every 
neighbourhood V of y, F\ {?/} contains infinitely many points of X. We say then 
that 2 / is a limit point of X. 

Given a sequence (x„)n,gN hr E and y G E, we say that y is adherent to 
the sequence, if every neighbourhood V of y verifies Xn G V for infinitely many 
values of n. We say that the sequence converges to y, and write Xn —>■ y, if 
for every neighbourhood V of y, there is some ny G N such that for every 
n > nv, Xn G V. 

E is compact if from every covering of if by a family of open sets, one can 
extract a covering by a finite sub-family of those open sets: if if = Gi 
for a family Gi, i G I, of open sets, then there is a finite J C I such that 
E = UjG J ^ sequentially compact if from every covering of if by a countable 
family of open sets, one can extract a covering by a finite sub-family of those 
open sets. The following is well-known: 

— if is sequentially compact iff for every sequence in E there is a point in E 

adherent to it. 

— If if is Ti , if is sequentially compact iff for every infinite X C E, some y G E 

is a limit point of X. 

A subset A of if is compact if the relative topology on X is compact. Every 
finite set is compact, and the intersection of a compact set with a closed set is 
compact. 

Write K. for the family of compact subsets of E. In [2], one writes 7i instead 
of K! , and we followed this convention in [10]. Given Y C A, F is compact in 
the relative topology on A iff it is compact in the topology on if, that is: 

yxcE, ic{x) = ic{E)nv{x) . (3) 

On the other hand, if Y is closed in if, it is closed in the relative topology on A, 
that is 



VAC A, E{X)DE{E)nV{X) , 
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but for X closed the reciprocal holds, so 

VX G T, T{X) = T{E) n V{X) . (4) 

Assume now that {E, d) is a metric space. For r > 0 and x G E we de- 

O 

fine Br{x) and Br{x), the closed ball and the open ball, respectively, of radius r 
centered about x, by 

Br{x) = {y G E \ d{x, y) < r} and Br{x) = {y G E \ d{x, y) < r} . (5) 

Here the open sets are all unions of open balls. 

Given a subset X of E, for every point p G E we define the distance between p 
and X as 

d{p,X) = inf{d(p, cc) | x G X} . (6) 

Note that the distance to X is continuous, because we have 

\d{p,X) - d{q,X)\<d{p,q) . 

For r > 0 we define the dilation of radius r as the map Sr '■ 'P(E) — > V{E) given 
by 

5r{X) = U Br{x) , (7) 

and for r > 0 we define 5°, the open dilation of radius r, by 

5°{X) = y Mx) . (8) 

We have always 

5°r{X) = {pGE\d{p,X)<r} . (9) 

Note that <5° (A), being a union of open balls, is open. On the other hand we 
have in general the inclusion 

5r{X)C{pGE\d{p,X)<r} ■ (10) 

the condition for having equality will be discussed in Subsect. 2.1. However we 
have always: 

{pGE\ d{p,X) < r} = fl <5,(A) = f| 5: (A) . (11) 

s>r s>r 

Note that for A bounded, Sr{X) is also bounded: indeed, if A C Bs{p) for some 
p G E and s > 0, then 6r{X) C 5r{Bs{p)) C Br+sip)- 

A Cauchy sequence is a sequence (x„)„gN in E, such that for any e > 0 there 
is some M G N such that for all m,n> M we have d{xm,Xn) < £■ We say that 
the metric space {E, d) is complete if every Cauchy sequence in E converges to 
some point in E. 

There are several equivalent formulations for compactness of a metric space: 
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Property 1. The following properties are equivalent in a metric space E\ 

1. if is compact. 

2. if is sequentially compact. 

3. For every infinite X C E, some point of if is a limit point of X. 

4. E is complete, and for every r > 0, there is a finite subset X{r) of E such 
that F; = 

5. Every sequence in E contains a subsequence converging to a point in E. 

Given X C E, {X, d) is a metric space, and the metric topology of {X, d) 
coincides with the relative topology on X induced by the metric topology of 
(if, d)\ in other words, the open sets of {X, d) are all sets XC\G, where G ranges 
over the open sets of (E,d). 

2.1 Boundedly Compact, Boundedly Finite, and Proximinal Sets 

Note that every compact subset of a metric space is always bounded and closed. 
We consider here the case where the reverse holds. 

Lemma 1. The following properties are equivalent in a metric space E: 

1. For every p £ E and r > 0, Br{p) is compact. 

2. Every bounded and closed (infinite) subset of E is compact. 

3. For every infinite and bounded X C E, some y £ E is a limit point of X. 

Proof. 1 implies 3. Since X is bounded, we have X C Br{p) for some p £ E 
and r > 0. As Br{p) is compact, by Property 1, some y £ Br{p) is a limit point 
of A. 

3 implies 2. Let Y be bounded and closed. If Y is finite, then Y is trivially 
compact, so we can assume that Y is infinite. Given an infinite A C y, A is 
bounded, and so some y S if is a limit point of A; but then y £ X, and as Y is 
closed, A C y, and hence y £ Y. By Property 1, Y is compact. 

2 implies 1. For every p £ E and r > 0, Br{p) is bounded and closed. □ 

We say then that E is boundedly compact. In [8], one says finitely compact, 
the terminology is borrowed from Busemann [4]. A subset A of if is said to be 
boundedly compact if the metric subspace (A, d) is boundedly compact. 

A subset A of if is called proximinal [3,6] if either A = 0, or for every y ^ X, 
there is some x £ X minimizing the distance to y, in other words 

Vy ^ A, 3x £ X d{y, x) = d{y, X) . 

Write Fbc and Tp for the families of respectively boundedly compact and prox- 
iminal subsets of E. 



Proposition 1. in a metric space E: 

1. ICC CFpCF. 
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2. If E is boundedly compact, then every closed subset of E is boundedly com- 
pact, that is The = Tp = T . 

Proof. 1) If iC is compact, every closed subset of K is compact, so K is boundedly 
compact. Let X be boundedly compact; if X = 0, it is by definition proximinal; 
otherwise for p ^ X we take x G X and set r = d{p,x), then Br{p) n X is 
nonvoid, bounded and closed (relatively to X), so it is compact and there is 
some y G Br{p) r\ X such that 

d{p, y) = mm{d{p, z) \ z G Br{p) C X} = min{d(p, z) \ z G X} , 

that is X is proximinal. If Y is proximinal, either Y = %, which is closed, or 
for p ^ Y we take y G Y such that d{p,y) = d{p,Y), so d{p,Y) > 0, and for 

0 < r < d{p, F), Br{p) C E\Y, and Y is closed. 

2) Let F be a closed subset of E. Given a subset K of E which is bounded 
and closed relative to E, by (4) K is closed in E, Assuming E to be boundedly 
compact, K will be compact in E, so by (3) K is compact relatively to F. 
Hence F is boundedly compact. By item 1, we deduce that The = Tp = T. □ 

Proposition 2. In a metric space E, a set X is proximinal iff for every r > 0 
we have 

Sr{X) = {pGE\d{p,X)<r} . (12) 

In particular, dr{X) is closed. 

Proof. If A = 0, A is proximinal, and both sides of (12) are the empty set. 
Assume now that A yf 0. For r > 0 we have 

(5^(A) = {p G E \ 3x G X, d{p, x) < r} . 

If A is proximinal, for every p G E, there is some x G X such that d{p, x) = 
d{p,X) (indeed, either p G X and we take x = p, or p ^ X and this is true by 
definition). So the above equality becomes (12). If A is not proximinal, there 
exists y ^ X such that for every x G A we have d{y,x) > d{y,X); setting 
r = d{y, A), we have y ^ dffX) but y G {p G E \ d{p, A) < r}. 

Being the inverse image of the closed interval [0,?’] by the continuous map 
A — > R : p I— > d{p,X), {p G E \ d{p,X) < r} is closed. So for A proximinal, 
Sr{X) is closed by (12). □ 

Corollary 1. The following properties are equivalent in a metric space E: 

1. E is boundedly compact. 

2. For every nonvoid compact K and r > 0, dffK) is compact. 

3. For every nonvoid closed F and r > 0, Sr{F) is boundedly compact. 

Proof. 1 implies 3. Let E be boundedly compact, take a nonvoid closed F and 
r > 0; by Proposition 1, F is proximinal, and by Proposition 2, SffF) is closed; 
thus 5r{F) is boundedly compact by Proposition 1. 
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3 implies 2. Let K he a nonvoid compact; as K is closed, 6r{K) is bound- 
edly compact. But K is also bounded, hence 5r{K) is bounded. Being a bounded 
closed subset of a boundedly compact set (itself), 6r{K) is compact. 

2 implies 1. If Sr{K) is compact for every nonvoid compact K and r > 0, 
take p £ E, and as {p} is compact, Br{p) = Sr{{p}) is compact; thus E is 
boundedly compact. □ 

Now we consider a generalization of boundedly compact sets: 

Lemma 2. The following properties are equivalent in a metric space E: 

1. For every p £ E and r > 0, Br{p) is finite. 

2. Every bounded subset of E is finite. 

3. Every bounded subset of E is compact. 

4 . E is boundedly compact, and every subset of E is closed. 

Proof. That 1 implies 2 is trivial. 

2 implies 4- Every bounded and closed subset of E is finite, and hence 

O 

compact; thus E is boundedly compact. For any p £ E, as the balls Br{p) are 

O 

finite for all r > 0, there is some r{p) >0 such that Bj.(p){p) = {p}, so that 
every singleton in E is open; therefore every subset of E is closed. 

4 implies 3. Les X be a bounded subset of E] by hypothesis, X is closed, 
and as E is boundedly compact, X is thus compact. 

3 implies 1. Suppose that for some p £ E and r > 0, Br{p) is infinite. Then 
it contains a sequence (a:„)„gN whose terms are mutually distinct (a:„ yf Xm for 
n yf m). As Br{p) is bounded, it is compact, so by Property 1, this sequence 
contains a subsequence (xt,,^)meN converging to some y £ Br{p). As the 

are pairwise distinct, there is some fc G N such that for m > k, Xi^ yf y. 
Let K = I m > fc}; as iL is bounded, it is compact, so by Property 1 

the sequence {xi^)m>k contains a subsequence converging to some Xi_,, t > k; 
however such a subsequence must converge to y, and y yf x^j, so we have a 
contradiction. Therefore Br{p) must be finite. □ 

We say then that E is boundedly finite. A subset A of if is said to be bound- 
edly finite if the relative topology on X is boundedly finite. Write Ebf for the 
family of boundedly finite subsets of E. Clearly every boundedly finite set is 
boundedly compact (and hence proximinal and closed). A set is boundedly finite 
and compact iff it is finite. Note that if E is boundedly finite, then every subset 
of E is boundedly finite (and hence proximinal and closed). 

2.2 Hausdorff Metric on Compact, Closed, and Proximinal Sets 

Let X and Y be two nonvoid compact subsets of E. The set of d{x, Y) for x S A 
attains a maximum, so we define: 



hd{X, Y) = max{d(x, Y) \ x £ A} , 



(13) 
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which we call the oriented Hausdorff distance from X to Y. Note that for x G X 
the set of d{x, y) for y G Y admits a minimum, and so 

3cr* G X, 3y* G F, d{x*,y*) = d{xffY) = hffX,Y) . (14) 

We define the Hausdorff distance between X and Y as: 

ifd(X,r) =max(hd(X,y),/rd(X,X)) . (15) 

It is well-known [2] that Hd is indeed a distance function on the space tC of 
nonvoid compact subsets of E. 

We have the following characterization [2] of hd and Hd'. 

Property 2. For every X,Y G K.' and for every r > 0: 

hd{X,Y)<r XCSffY), 

Hd{X,Y) <r X <ZSr{Y) and Y <G SffX) . '' '' 



In particular 

— hd{X,Y) is the least r > 0 such that X C Sr(Y). 

— Hd{X, Y) is the least r > 0 such that both X C Sr(Y) and Y C Sr(X). 

In the case where E = R" and d is the Euclidean distance, in the above 
formulas Sr{Y) and 5r-(X) are replaced by the Minkowski additions Y G) Br and 
X (B Br respectively, where Br is the closed ball of radius r centered about 
the origin; such an expression of the Euclidean Hausdorff distance in terms of 
Minkowski additions was considered in [14]. 

A well-known theorem [2] states that: 

Property 3. If (£’,d) is complete, then {IC{E),Hd) is also complete. 

We can extend the Hausdorff metric from IC to T' [1]. Given two nonvoid 
closed sets X and Y , we set 



hd{X, Y) = snp{d{x, Y)\xGX} , (17) 



but here (14) does not hold; now we define the Hausdorff distance Hd{X, Y) as 
in (15). Then Hd is a generalized metric on X; by this we mean that Hd satisfies 
the axioms of a metric, with the only difference that it can take infinite values. 
Nevertheless (IF', Hd) will define a topology in the same way as a metric space. 
Using (11), we see easily that Property 2 becomes here: 

Property 4- For every X,Y G T' and for every r > 0: 



hd)X, Y) < r 
Hd{X,Y)<r 



In particular 



^ca>,<5.(T), 

and 



(18) 
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- hd{X,Y) = inf{r > 0 I X C ^,(F)}. 

— If hd{X, Y) < oo, it is the least r > 0 such that X C ns>r 

- Hd{X, Y) = inf{r > 0 | X C J^(y) and Y C ^^(X)}. 

— If Hd{X, Y) < oo, it is the least r > 0 such that both X C Pls>r and 

rca>,< 5.(x). 

However, from (12) it follows that Property 2 remains true for two nonvoid 
proximinal sets X, Y. 

We have an alternative definition for the Hausdorff distance on X' [1]: 

VX,Fg^', Hd(X,Y) = sup \d(p,X) - d{p,Y)\ . (19) 

peE 

Note that it is also possible to define the Hausdorff distance when one of the 
sets is empty, using formulas (6,17), and setting an empty supremum to 0 and 
an empty infimum to oo: 

wp w hd{0,0) = hd{0,F) = 0 and /id(y,0)=oo; . . 

’ 77^(0, 0) = O and 77^(7^, 0) = oo . ^ > 

Then Property 4 remains true in this particular case. 

2.3 Compact Families of Compact Sets 

Consider a metric space {E,d). Recall the characterization of compact sets in 
a metric space given in Property 1. The set K.'{E) of nonvoid compact subsets 
of the metric space (7f , d ) , provided with the Hausdorff distance 77^ is a metric 
space (X'{E),Hd). We can consider the topology on JC'{E) induced by 77^, and 
define compact sets in it; we write thus IC{IC{E)) for the set of nonvoid compact 
subsets of in other words the set of all nonvoid compact families (for Eld) 

of compact subsets (for d) of E. 

From (3) we deduce that for every X C E, 

/c'(/c'(x)) = /c'(/c'(F))niP(/c'(x)) = /c'(/c'(F))niP(iP(x)) . (21) 

We will generalize to compacts the well-known facts that: a) the union of a 
finite family of finite subsets of E is finite, and b) the set of parts of a finite set 
is finite. The following result was proven in [7]: 

Proposition 3. Eor K. G JC {1C (E)) , [j K. G JC{E). 

Proof. Let (x„)„gN be any sequence in IJ/C. Then for each n G N there is 
some Kn G 1C such that G X„. Since 1C is compact in the metric space 
{IC'{E),Hd), the sequence (X„)„gN contains a subsequence (Xi„)„gN converging 
to X G /C. Since K is compact in {E, d), for each n G N there is some yn G K such 
that d{xi^,K) = d{xi^,y„), and the sequence (yn)ragN contains a subsequence 
(2/jn)™eN converging to y G K. For each n G N we have 



d{xi ^ , y„) = d{xi ^ , K) < hd{Kn, K) < Hd{Kn, K) , 
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and as Hd{K„, K) 0 for n — > oo, we get d{xi^,yn) ^ 0 for n — > oo. Thus 

d{xij^ , Dj^) — > 0 and , y) — > 0 for n ^ cxd, so we deduce that the subsequence 

)™eN converges to y; now y € A" € /C, so y S IJ /C, and we have thus shown 
that any sequence in IJ /C contains a subsequence converging to an element of 
U /C, so IJ /C is compact. □ 

Proposition 4. For K € JC'{E), JC'{K) € K.'{JC'{E)). 

Proof. Let (AT„)„gN be a sequence in JC'{K). Take any e > 0; we will first 
show that there is an infinite sequence Tg in N such that for all n, n' C Tg 
we have Hd{Kn, Kn') < Ser. Since K is compact, there exists a finite number of 
points xi, . . . ,Xm & K such that K C Bg{xi)Li- ■ ■LlBg{xm)- For each n G N and 
j = I, ... ,m we choose Xnj G AT„ as follows: Xnj G KnC\Bg{xj) if KnC\Bg{xj) yf 
0, while Xn,j is any element of AT„ if Kn H Bg{xj) = 0. For AT„ n Bg{xj) yf 0, 
since Xnj G Bg{xj), the triangular inequality implies that Bg{xj) C B 2 g{xn,j), 
so that Kn n Bg{xj) C B 2 g{xn,j)', on the other hand for AT„ n Bg{xj) = 0, we 
have Kn n Bg{xj) C B 2 g{xn,j) anyway. Now since AT„ C K C U • • • U 

Bg{x 

m ), we get 



Kn = {Kn n Bg{xi)) u • • • u (a:„ n Bg{xi)) 

with Kn n Bg{xj) C B 2 g{xn,j) for each j = 1, . . . , m. Hence 

Vn G N, Kn C B 2 g{Xn,l) ^ ^ B 2 g{Xn, 7 n) ■ (22) 

For t = 0, . . . , TO, we obtain by induction an infinite sequence St in N such that 
for 1 < J < f the sequence {Xn,j)neSt converges. Indeed, for t = 0 we have Sq = 
N; given 5*4 for 0 < t < to, since K is compact the sequence {xn,t+i)neSt contains 
a converging subsequence {xn,t+i)neSt+i (where St+i is a subsequence of St), and 
clearly for 1 < j < t, as {xn,j)neSt converges, {xn,j)neSt+i still converges to the 
same limit. Therefore for j = 1, . . . ,to, {xnj)neSm converges, and in particular 
it is a Cauchy sequence. Thus there is an infinite sequence Tg in N, which is a 
“tail” of Sm, such that for j = 1, . . . , to and n, n' G Tg, d{xn,j,Xn',j) < £. By the 
triangular inequality, we get then B 2 g{xn,j) C B^g{xn',j). Then (22) gives: 

Vn, n' G Tg, Kn C B 3 g{Xn',l) U • • • U B3e(a^n'.m) • 

As Xn' , 1 , . ■ . ,Xn' ,m G AT„/ , this means that Kn C 63 g{Kn'); reciprocally, we 
have Kn' C S 3 g{Kn), so that by (16) we have Hd{Kn, Kn') < 3e for all n, n' G Tg. 

For 0 < r] < £, writing Tg = {in | n G N}, the above argument applied to the 
sequence (ATi„)„gN gives an infinite subsequence T^j which is a subsequence of Tg, 
such that Hd{Kn, Kn') < 3y for all n, n' G T^j. Now we build the sequences Ti/^n 
{n > 1) and the infinite sequence (sn)n>i inductively as follows: 

— Si is the least elements of T 1 / 3 . 

— For n > 1, build Ti/ 3 „ from 7i/3(„_i), and s„ is the least m G T^/^n such 

that TO > Sn-l. 
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As is a subsequence of Ti/ 3 „ for u > n, for any n > 1 and u,v > n 

we have s„,s„ € Ti/ 3 „, so that Hd{Ks^, Kg„) < 1/n, and hence (ATs„)n>i is 
a Cauchy sequence in JC'{K). By Property 1, {K,d) is complete, so that by 
Property 3, {IC{K),Hd) is complete, and hence the Cauchy sequence {Ks^)n>i 
converges in JC'{K). 

We have thus shown that from every sequence (A'n)nGN in IC'{K) one can 
extract a subsequence (Ais„)n>i converging for which means thus that IC{K) 
is a compact subset of /C' (if). "Obviously JC'{K) yf 0, since K G JC'{K). Therefore 
K.'{K) G 1C'{V{E)). □ 

A well-known result (see [2], ex. 6.16, p. 33) states that for Ai, A 2 ,Pi,y 2 G 

iJd(AiUA2,FiUy2) <max(iJd(Ai,yi),iid(A2,F2)) • 

We generalize it here: 

Proposition 5. Let r > 0 and X,y G K.'{K.'{E)) such that for every X G X 
there is Yx G y with Hd{X,Yx) < r, and for every Y G y there is Xy G X 
with Hd{Y, Xy) < r. Then i?d(U X,[jy) < r. 

Proof. Note that \jX,[jy G 1C{E) by Proposition 3. For each X G X we 
have Yx G y with E[d{X,Yx) < r, so that X C 6r{Yx) by (16), and as Yx C 
(J y, we have SrfYx) C Sr (U 3^) ; thus every X G X verifies X C (U 3^) i ^'nd we 
deduce that [JX C 5^. ([J 02) . Reciprocally, as for every F G 3^ there is Xy G X 
with Hd{Y,Xy) < r, we deduce that [JF C Sr{[jxy Then Hd[[jX,[jy) < r 
by (16). □ 

3 Hausdorff Sampling into a Boundedly Compact Space 

In [10] we gave a new theory of discretization of compact sets; we choose as 
possible discretization of a nonvoid compact K any nonvoid discrete set S such 
that the Hausdorff distance between S and K is minimal. Formally, we have a 
“Euclidean” metric space (if, d) and a “discrete” subspace D, where % ^ D C E, 
on which we impose only the following two axioms: 

1. D is boundedly finite. 

2. The number Vc = sup^.^^ d{x, D) is finite. 

The number rc is called the covering radius (of D for the distance d). The first 
axiom implies that JC' {D) consists of all nonvoid finite subsets of D] this axiom 
allowed us to show that for K G JC{E), the set 

Mh{K) = {Sg JC'{D) I VT G IC'{D), Hd{K,S) < Hd{K,T)} (23) 

is nonvoid, finite, and closed under union, so that M.h{K) has a greatest element, 
namely 

Ah{K) = [}Mh{K) . 



( 24 ) 
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We called any element of Mh{K) a Hausdorff discretizing set (or Haus- 
dorff discretization) of K, while Ah{K) was called the maximal Hausdorff 
discretization of K. Defining the Hausdorff radius of K (w.r.t. D for the dis- 
tance d) as the nonnegative number rniK) = maxfcg/y d(fc, D), we showed that 
for S G M.h(K), Hd{S,K) = We gave a characterization of Ah{K) 

and of elements of M.h{K) (we will consider it again in the next section). We 
have rniK) < r^, so that the second axiom allows us to have a bound on the 
Hausdorff radiuses rniK) in terms of the resolution of D, of which Vc is a mea- 
sure. 

We will see here that one can loosen the first axiom by requiring that 
1. I? is boundedly compact, 

instead of boundedly finite. Then most results of [10] will remain valid, provided 
that we replace “finite” by “compact” in each statement. We concentrate partic- 
ularly on the results of Subsect. 3.3 of [10]. We will discuss those of Subsect. 3.4 
of [10] in more detail in the next section, devoted to the extension of the theory 
to closed sets. 

First Lemma 13 of [10] becomes: 

Lemma 3. For every x € E and r > 0, Br{x) f] D is compact. More generally, 
for every compact K A E, dffK) f] D is compact. 

Proof. Let K be compact. As K is proximinal, by Proposition 2 SffK) is closed, 
and as K is bounded, SffK) is bounded. Hence 5r{K)PD is, bounded and closed 
in the relative topology of D; as D is boundedly compact, SffK) n D is compact 
in D, and therefore it is compact in E (cfr. (3)). Taking for K the singleton {x}, 
we have SffK) = Bffx), so Bffx) n D is compact. □ 

In particular, taking r = 0, for any K G IC{E) we have K O D compact. 
Under the assumption that D is boundedly finite, we had Corollary 14 of [10], 
which stated that the compact subsets of D coincide with its finite subsets. Here 
such a result does not apply, we simply have by (3): JC{D) = JC{E) n V{D). 

As D is proximinal, we have as in equation (17) of [10] that 

Vx G E, 3p G D, d{x,p) = d{x, D) . 

In particular for x ^ D we have d(x, D) > 0, and since D E, we have > 0. 

Lemma 16 of [10] states that Xc is the least r > 0 such that E = SffD). This 
remains true here, the proof given in [10] remains valid, but this result is in fact 
a consequence of Proposition 2. 

The next result is a generalization of Proposition 18 of [10]: 

Proposition 6. For K G JC{E), 

1. Mh{K) G IC {IC (D)) , in particular A4 h{K) 0; 

2. given X G /C'(/C'(D)) such that X C M.h{,K), we have [jX G Mh{K); in 
particular: 
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— M.h{K) € A4ff(K), and 

— for a closed y C JC'{E) such that y n Mh{K) 7 ^ 0; ^ A4//(iC)) G 

Mh{K); 

3. given a decreasing sequence (5'„)„gN of elements of Mh{K), HnGN G 

Mh{K). 

Proof. 1) Let p £ D and set r = maxk^K d{p, k). Write Z = 6r{K) n D] clearly 
p £ Z, so that Z 0; also Z is compact by Lemma 3; thus Z £ JC'{D). By 
Proposition 4, K.'{Z) £ IC{IC{D)). As the function Hd{-,K) : K.'{D) — > R+ : 
S I— > Hd{S,K) is continuous, it reaches a minimum s on the nonvoid compact 
IC'{Z), that is: a) every S £ 1C{Z) gives Hd{S,K) > s, and b) there exists 
S £ JC{Z) such that Hd{S, K) = s. Take the nonvoid set 

M = {S£lCiZ)\HdiS,K) = s}; 

Clearly {p} G 1C{Z), and so Hd{K,{p}) = ma,Xk ^ k d{p, k) = r (see Property 8 
of [10]); thus s < r. For S G K.'{D), if Hd{S, K) < r, then by (16) we have S C 
Sr{K), so that S C Sr{K)nD = Z and hence S G K.'{Z)\ thus Hd[s,K) > r> s 
for S ^ K.'{Z). Therefore s is the minimum of all Hd{S,K) for S £ K.'{D), and 
M. = Mh{K). Thus Mh{K) yf 0. Now Mh{K) is the inverse image of the 
singleton {s} by the continuous function Hd{-,K) : JC'{D) R+, and so it is 
closed. Thus, being a closed subset of the compact set IC{Z) in the space IC'{D), 
Mh{K) is compact, and as it is nonvoid, we have Mh{K) £ IC{IC{D)). 

2) Let X £ /C'(/C'(D)) such that X C Mh{K). By Proposition 3, IJA G 

JC'{D). For each S £ X we have Hd{S,K) = s, so applying Proposition 5 with 
y = {K}, we get HdifjX ,K) < a. But s is the least Hd{S,K) for S £ JC'{D), 
and we deduce that Hdi\}X ,K) = s, so that [JX £ Mh{K). In particular, 
taking X = Mh{K), we get IJ M.h{K) £ Also, for a closed y C IC{E), 

as Mh{K) is compact in IC{D) (and so in IC{E), see (21)), y n Mh{K) is a 
compact subset of IC{E); if we have also y C Mh{K) yf 0, then y n Mh{K) £ 
JC{JC{E))£,V{K.'{D)) = K.'{JC{D)). Taking A = yf^MH(.K), {J{ynMHiK)) £ 
Mh{K). 

3) Consider a decreasing sequence (5'„)„gN of elements of Mh{K), and let 

S = HnGN each n G N, Eld{Sn,K) = s, and as Sn is proximinal, for 

every x £ K i?s(x) C yf 0; as the Bg{x) C 5'„ (n G N) constitute a decreasing 
sequence of nonvoid closed subsets of the compact Bs{x) C 5'o, by the finite 
intersection property, Bg{x) H S = rinGN(-®s(^) ^ 0- Thus S' yf 0, and so 

S £ JC{D), and as Bs{x) D S 9 for every x £ K, we have hd{K, S) < s. Now 
as S C So, hd{S,K) < hd{So,K) < s. Therefore Hd{S,K) < s, from which we 
conclude that S £ Mh{K)- □ 

Note in particular that any finite subset of M.h{K) is compact; thus 
for Si, . . . , S„ G Mh{K), we have Si U • • • U S„ G M.h{K). 

Now the rest of our theory, in particular the characterization given in Sub- 
sect. 3.4 of [10], remains valid. Note that here the discretization of radius r of AT, 
namely Ar{K) = Sr{K) n D (see (26) below), is compact (instead of finite, as 
in [10]) for every compact K (this follows from Lemma 3). 
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4 Hausdorff Sampling of Closed Sets 

We still assume that D is boundedly compact and that rc < oo. To stress the 
fact that D is not necessarily discrete, we will systematically say “sampling” 
instead of “discretization” or “discretizing set”. We will now define Hausdorff 
sampling for a closed subset of E instead of a compact one. 

For F e E'{E), a Hausdorff sampling of F is any S G T'{D) which minimizes 
the Hausdorff distance Hd{F,S). Here (23) becomes 

Mh{F) = {S G E\D) \ yr G E\D), Hd{F,S)<Hd{F,T)} . (25) 

Note that when F is compact, it is bounded, and so Hd{F, S) < oo implies that S 
is bounded, hence S is compact (because D is boundedly compact). Thus for 
compact sets, (25) is equivalent to (23), in other words the two definitions of a 
Hausdorff sampling coincide. 

The argument of items 1 and 2 of Proposition 6 does not extend naturally 
to the non-compact case. We can only show the following: 

Proposition 7. For F G T'{E), 

1. Mh{F) is closed in the topology of {T' {D), Eld); 

2. given X C M.h{F) such that [jX G F'{E), we have [JX G M.h{F); 

3. given a decreasing sequence (S'„)„gN of elements of A4 h{F), PlnGN ^ 
Mh{F). 

Indeed, item 1 holds because Mh{F) is the inverse image by the continuous 
function Hd{-,F) : T'{D) R+ of the singleton {s}, where s is the infimum of 
all Hd{F, T) for T G T'{D)). Now item 2 is proved in a similar way as in Propo- 
sitions 5 and 6. Finally item 3 has exactly the same proof as in Proposition 6. 

However, we cannot prove by a similar argument that M.h{F) is nonvoid 
and has a greatest element Ah{F). We will thus apply here the method used 
in Subsect. 3.4 of [10], where an explicit formula was given for Ah{F). Having 
such a formula in our case will prove the existence of the maximal Hausdorff 
sampling, and hence Mh{F) will be nonvoid. 

For r > 0, the sampling of radius r [10] is the map A^ : V{E) — > V{D) 
defined by 

VX C E, Ar{X) = 5r{X) C]D = {pGD \ Br{p) n X yf 0} , (26) 

while the open sampling of radius r is the map A° : V{E) —>■ V{D) defined by 

yx C E, A°{X) = 6°{X) nD = {pGD \Br{p) n X 0} . (27) 

Finally, we define the upper sampling of radius r as the map Af : V{E) V{D) 
given by 

At{X) = f| A,{X) . 

s^r 



(28) 
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Lemma 4. For X € V{E) and r >0, 

At{X) = {peD\d{p,X)<r} , (29) 

and this set is closed. For F G Tp{E), Z\+(F) = Ar{F). 

Proof. By (11), 

{peD\ d{p,X) < r} = (f| 5,(X)) nD=f]{Ss{X)nD) = f] As{X) . 

s>r s^r s>r 

So (29) follows. Now this set is closed because it is the inverse image of the closed 
interval [0, r] by the continuous map D R : p d{p, X). 

When F is proximinal, (12) gives: 

{peD\ d{p, F)<r} = 5r{F) nD = Ar{F) . 

Comparing this to (29), we get Af{F) = Ar{F). □ 

We extend the definition of the Hausdorff radius from a compact set to a 
closed one, by replacing the maximum by a supremum in the formula: 

yFGE\E), rniF) = sup d{x, D) = hd{F, D) . (30) 

xGF 

Obviously Tc = rniE) and rniF) < rc for F G E'{E). Lemma 22 of [10] 
remains true: rniF) is the least r > 0 such that F C 6r{D). The proof given 
in [10] remains valid, but this is also a consequence of Proposition 2. 

Now Proposition 23 of [10] becomes: 

Proposition 8. Let F G T'{E) and S G E'{D). Then F[d{F,S) > rniF), and 
for any r > 0 we have: 

— Hd{F, S) <r iff both S C Af{F) and F C Sr{S). 

— Assuming F to he proximinal: E[d{F,S) < r iff both S C Ar{F) and F C 

5r{S). 

Proof. As S C D, we have P[d{F, S) > hd{E, S) > hd{F, D) = rniF). Let r > 0. 
As D is boundedly compact, by Proposition 1 5" is proximinal, so we can apply 
(16): hd(E, S)<riSFC Sr{S). By (29), hd(S, F) < r iS 

SC{pGD\ d{p, F)<r} = Af{F) . 

In the case where F is proximinal, by Lemma 4 this becomes S C Ar{F). 
Since P[d{F,S) < r iff both hd{F,S) < r and hd{S,F) < r, the two items are 
verified. □ 

We get then the following generalization of Proposition 33 of [10]: 

Corollary 2. For F G T'{E) and r > rniF), we have Hd{F, Af{F)') < r 
and Hd[F, Ar{F)^ < r. 
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Proof. Here we have F C 6r{D), and from Proposition 10 of [10] (cfr. the argu- 
ment at the beginning of the second paragraph of the proof of Theorem 24 
of [10]), this means that F C 5r{Ar{F)) . As Ar{F) C Ar{F) C Af{F), 
F C 5r{Ar{F)) C Sr{Af{F)). Now Af{F) is closed (by Lemma 4). Hence 
the result follows by taking in Proposition 8 successively S = Af{F), and 
S=Ar(F). □ 

We can now give the generalization to closed sets of the characterization of 
the maximal Hausdorff discretization and Hausdorff discretizing sets given in 
Theorem 24 of [10]: 

Theorem 1. For F G F'{E), 

1. A'^^^p.^(F) is the greatest Hausdorff sampling of F; 

2. the Hausdorff samplings of F are the closed subsets of whose 

dilation of Hausdorff radius covers F: 

Mh{F) = {Sg F'iD) I 5 C and F C 5,„(;^)(5)}; 

3. the Hausdorff radius of F minimizes the Hausdorff distance beween F and 

nonvoid closed subsets of D: 

rniF) = min{iJ(i^, S') | S € F'{D)}, that is = rniF). 

Note: If F is proximinal, then Aff^f^ps^{F) = A.^^(^p^{F) (by Lemma f). 

Proof By Proposition 8, Hd{F, S) > rniF) for all S G F'{D). 

By Lemma 4, is closed. By Corollary 2, i?d(F, zl+^(^j(S)) < 

rH{F), so by the preceding paragraph we have the equality Hd[F, A'^^^p.^(F)'^ = 
rui^F), and rn^F) is indeed the minimum of all Hd{F,S) for S G F'{D). Thus 
item 3 holds. 

Now for any S S F'{D), we have S G Mh{F) iff Hd{F,S) = rpiF), and 
as Hd{F, S) > rniF) anyway, this holds iff Hd{F, S) < rniF). By Proposition 8 
with r = r//(F), this is equivalent to having both S C A(f^f^p.^{F) and F C 
5rH{F){S). Hence item 2 holds. 

We saw in the second paragraph (proof of item 3) that Hd{F, A~^^^p.^{F)) = 
rniF). Thus A+^(^^(S) e Mh{F), and as S C zl+^^^j(S) for every S G 
Mh{F), so (F) is the greatest element of M.h{.F), and item 1 holds. □ 

In the case where F is proximinal, the above result takes exactly the same 
form as Theorem 24 of [10] in the case of compact sets; the only difference is 
that we consider here S G F'{D) instead of S' G JC'{D). 

For any F G F'{E), we have two possibilities: 

1. there is some p G F such that d{p,D) = rH{F); 

2. for every p G F, d{p,D) < rniF). 

In the first case, we say that F is rp -reached, in the second that F is rp- 
unreached. Note that every compact is r//-reached, because the continuous func- 
tion d{-,D) reaches a maximum on it. 

The following two results illustrate the meaning of this distinction. 
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Proposition 9. Define A : V'{E) 'P(D) by 

A{X) = IJ Ah{K) . 

K^K'(X) 

Then for F G F'{E) we have: 

1. if F is TH-reaehed, A{F) = Aj.^^p'^{F); 

2. if F is rn-unreaehed, a[f) = 

Proof. 1) Take F rjj-reached. For K G K'(F), we have rpiK) < rpiF); as 
Ar{X) is increasing both in r and in X, we get: 

Ah{K) = ArH[K){K) c ArH{F){K) C ArH{F){F) . 

Thus A{F) C A^^(^p^{F). Now let p e -F such that d{p,D) = rH{F). For any 
X G F we have x}) = rniF), and so 

^rH(J’)({4) C A^^(^pj{{p,x}) = Ah{{p,x}) C A{F) . 

But Aru is a dilation, so that 

^rH{F){F) = [J ^rH(F){{x}) , 

xeF 

and the previous equation gives thus Ar^(p'f{F) C A{F). The equality follows. 

2) Take F r//-unreached. For K G KfF), K is r//-reached, so rpiK) < 
rH{F); as Ar{X) C zi°(X) for r < s, and zi° is increasing, we get: 

Ah{K) = C A°^^p^{K) C A°^^p^{F) . 

Thus A{F) C Now for every r < rpiF) there is some p G F such 

that d(p, D) > r. For any x G F we have rp^p, x}) > r, and so 

^r{{x}) C Ar{{p,x}) C Ah{{p,x}) C A{F) . 



But 

Ar{F) = IJ Ar{{x}) , 

xeF 

so the previous equation gives Ar{F) C A{F) for every r < rpiF). From the 

O 

equality Bj.h(f){x) = \Jr<rH{F) ^r(x), we deduce easily from the definitions 
(7, 8, 26, 27) that 

^r„iF)(X)= U Srix) and y Ar(X) , 

r<rjy(-^) r<rH{F) 

so that A°^f^p.^{F) C A(F). The equality follows. □ 
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Proposition 10. For F € 

1- ^rH(F){F) C Mh{F). 

2. If F is r H -unreached, then A°^^p^{F) C Mh{F). 

Proof. Item 1 follows from Corollary 2. Suppose now that F is r//-unreached. 
Thus d{x,D) < rniF) for every x G F. Take x G F and let r = d{x,D); then 

O 

r < rniF) and there is some p G D such that d{x,p) = r; thus x GBri^(F)ip) and 
p G As this is true for any x G F,we have F C SrH{F){^°^(^F'){F)) . 

Hence F C As C applying Proposi- 
tion 8, item 2 holds. □ 

We illustrate in Fig. 3 the distinction between Ar^(_F)(F) and A°^^p.^{F) for 
a r //-unreached closed set F. 

When D is boundedly finite (as was assumed in [10]), every subset of D 
is closed, so that Ar{F) is always closed in Corollary 2, while and 

^°h{f) always closed in Proposition 10, so we don’t need to take the closure 
of these sets. 

Finally, it is possible to extend our theory from F'{E) to F{E), in other 
words to consider Hausdorff sampling for the empty set (up to now, we always 
assumed that F G T'{E)). From (20) we can derive that: 

M^(0) = {0}, Aff(0) = 0, and r^(0) = 0 . (31) 

Then all the above results extend trivially (and 0 is r//-unreached). 

5 Conclusion 

In [10] we presented our theory for the discretization of nonvoid compact sets 
of a metric space {E, d) into a discrete subspace D. The possible discretizations 
of a nonvoid compact K Q E are all nonvoid compact S C D minimizing the 
Hausdorff distance to K. Such discretizations were characterized, and compared 
with the morphological approach using discretization by dilation [8]. 




Fig. 3. Let E = R^, D = Z^, with the Euclidean distance. The unbounded 
curve F is a closed set verifying rniF) = \j2j2, and it is r//-unreached. Filled 
circles designate discrete points in A°^^p^{E), and the hollow circle is a point 
in Aru(F){F) \ A^^^p.^{E) 
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This paper generalizes [10] in two ways. First, the space D is supposed to 
be boundedly compact instead of boundeldy finite, so D is not necessarily “dis- 
crete” . Second, the discretization applies to closed sets, not only non- void com- 
pact sets. 

Apart from the fact that D is boundedly compact, we simply assumed that 
the covering radius is finite (i.e., the distance from a point of E to the nearest 
point of D is bounded). In [10] this assumption was not necessary to prove 
the existence of Hausdorff discretization and to characterize it, but only to give 
a bound on the Hausdorff distance between a compact and its discretization, 
and so to derive the convergence of the discretization to the original compact 
set when the resolution tends to zero. Here in our extension to closed sets, this 
assumption of a finite covering radius is necessary for the existence of a Hausdorff 
discretization, because the Hausdorff distance between two closed sets can be 
infinite. However in practice one usually takes E = R" and D = nr=i(p*z) 
(where pi is the resolution or grid spacing along the *-th axis), so here the 
covering radius is finite. 

This paper is part of an ongoing project [10,11] to analyse the discretization 
of objects (sets or images), and of operators transforming objects, in terms of the 
Hausdorff distance. In a next paper [12], we will extend the analysis in [10] of the 
Hausdorff metric properties of the morphological discretization by dilation [8], 
and consider in detail the topologies associated to it. 
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Abstract. Abstract cell complexes (ACC’s) were introduced by Ko- 
valevsky as a means of solving certain connectivity paradoxes in graph- 
theoretic digital topology, and to this extent provide an improved the- 
oretical basis for image analysis. We argue that ACC’s are a very nat- 
ural setting for digital convexity, to the extent that their use permits 
simple, almost trivial formulations of major convexity results such as 
Caratheodory’s, Helly’s and Radon’s theorems. ACC’s also permit the 
use in digital geometry of axiomatic combinatorial geometries such as 
oriented matroids. We give a brief indication of how standard convex- 
ity algorithms from computational geometry applied to the points of an 
ACC can form a substantial part of digital convexity algorithms. 



1 Introduction 

Abstract cell complexes (ACC’s) were introduced by Kovalevsky [8] as a means 
of solving certain connectivity paradoxes in graph-theoretic digital topology, 
and to this extent provide an improved theoretical basis for image analysis. 
A very similar approach is that of Khalimsky [4]. As a datatype for digital 
geometry, however, it seems that ACC’s have hardly been considered, apart from 
by Kovalevsky [9]. (The basic definitions in this work are different to those in [9].) 
In this work we argue that ACC’s are a very natural theoretical setting for digital 
geometry, to the extent that they permit simple, almost trivial formulations of 
major convexity results such as Caratheodory’s, Helly’s and Radon’s theorems. 

ACC’s are non-homogeneous digital spaces in that they contain elements 
of different dimension, including both points (0-cells) and pixels (2-cells). We 
first develop a geometry of points of an ACC, which is nothing other than an 
oriented matroid structure.^ The geometry of cells, or “digital geometry”, is then 
constructed out of the point geometry. Computation of the point convex hull 
operation can be done using standard algorithms from computational geometry, 
and these algorithms can then play a substantial role in the computation of 
digital convex hulls. 

This work is part of an overall project to develop axiomatically defined dig- 
ital geometric spaces. The need for work on the mathematical foundations of 

* This work has been supported by the EPSRC project “Digital Topology and Geom- 
etry: an Axiomatic Approach with Applications to GIS and Spatial Reasoning.” 

^ No knowledge of oriented matroids is assumed here, but the standard reference is [1] 



G. Bertrand et al. (Eds.): Digital and Image Geometry, LNCS 2243, pp. 272—282, 2001. 
(c) Springer-Verlag Berlin Heidelberg 2001 
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digital geometry has been argued for by, among others, Frangon [2], Klette [6], 
Smyth [10], Voss [11]. One of the main intended applications (and indeed mo- 
tivations) of axiomatic digital geometry is the development of robust geometric 
algorithms. It is a fundamental issue in computational geometry that R" is often 
the wrong theoretical medium with which to describe the behaviour of geometric 
algorithms, in that the mismatch between the infinite precision of R" and the 
finite precision of computation can cause rounding errors. Our overall aim is to 
replace R" by equally valid - meaning, roughly, axiomatically defined spaces that 
have a geometry rich enough to describe and reason about geometric computa- 
tion - but discrete geometric spaces. The axiomatic foundations of computational 
geometry is the subject of Knuth’s monograph [7]. Knuth’s “CC systems” are a 
special class of oriented matroids. 

For simplicity we will only present a theory of 2-dimensional digital spaces 
here. The theory does generalize quite smoothly to arbitrary dimensions, how- 
ever, and this work is presented in [12]. 

2 Digital Planes 

For any n > 1, let /„ denote the set {0, . . . , n}. The following definition is a 
model of Kovalevsky’s axioms [9] . 

Definition 1. A 2-dimensional Cartesian ACC consists of a point set In x Im 
together with the following subsets, called digital cells.' 

— 0-cells are the singleton subsets; 

— 1-cells are the subsets {(x, y), (x, y -\- 1)} and {(x, y), (x -I- 1, y)}; 

— 2-cells are the subsets {(x, y), (x, y -\- 1), (x -I- 1, y), (x -I- 1, y -I- 1)}. 

We will here call such structures digital planes. When the context is clearly 
that of a digital plane, digital cells will often be referred to simply as cells. The 
definition is in fact a description of a particular model of Kovalevsky’s axioms [9] , 
one which we find convenient to use here. Our choice of description is based on 
the intuition that an ACC is an abstract version of the classical notion of a cell 
complex in combinatorial topology. We want to think of a cell as a polytope in 
an abstract point geometry, and an ACC as a tiling of cells. 

To see how the definition formally satisfies the ACC axioms, let T> denote 
the set of digital cells in a digital plane, and for any n-cell D let dim{D) = n. 
Then, where C denotes the relation of proper subset inclusion on T), clearly 
(D, C, dim) is an ACC exactly according to the definition in [8]. It is also very 
straightforward to check that it is a 2-dimensional Cartesian ACC according 
to [9] , although our coordinate system is different to the one considered in that 
work. We choose to think of only the 0-cells as having coordinates, as these are 
the only digital cells that represent points rather than subsets of the Euclidean 
plane. Moreover, Kovalevsky’s definitions of digital half-spaces and convex digital 
sets are expressed in terms of his choice of coordinates, but our definitions are 
different. 
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A cell C is a face of a cell D if C C D. A vertex of a cell is any of its faces 
that is a 0-cell. A digital set is any set of cells. By abuse of terminology we will 
often identify 0-cells with points. 

The points in a digital plane will be considered as points in K.^ in the obvious 
way. Digital cells are then considered as indexing subsets of as follows. We 
say that the continuous part of a digital cell D is the relative interior of the 
poly tope in whose vertices are the vertices of D. In other words: 

— The continuous part of the 0-cell {p} is {p} 

— The continuous part of the 1-cell {p, q} is the open interval in between 
p,q; 

— The continuous part of the 2-cell {p, q, r, s} is the interior in R^ of the filled-in 
square with vertices p, q, r, s. 

Definition 2. In any digital plane, the digital image o/ P C R^ is the set of all 
digital cells whose continuous part intersects P. 



Example 3. The diagram on the right shows the digital image of the shaded 
Euclidean set on the left. The diagram is supposed to represent a collection of 
digital cells, but it is far clearer to draw a digital cell as its continuous part, and 
we use this convention throughout. 
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That each point in a digital plane has integer coordinates allows for robust 
computation, but might seem unnecessarily restrictive in that it appears to fix 
the resolution. This is not so, however, as resolution can be adjusted using a 
simple scaling operation. Implicit in the definitions above is the embedding x i— > 
X of the point set A of a digital plane into R^, but for a different resolution we 
consider an embedding e : if — > R^, x i— > Ax. We then think of the continuous 
part of a digital cell D as the relative interior of the polytope with vertices e{D), 
and digital images are defined accordingly. All the results concerning digital 
images given here are true for whatever resolution is considered; a more formal 
account is given in [12]. 



2.1 Point Geometry 

In this section we forget about the cells in a digital plane and consider only the 
points. The point geometry of a digital plane is simply the geometry its point 
set E inherits as a subset of R^. We say that: 
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— A straight line in E is the intersection with E of any straight line L in 
that contains > 2 points of E] 

— The open half-planes determined by a straight line Lf]E are the intersections 
with E of the open half-planes determined by L in R^. 

This construction gives the structure of an oriented matroid on E (straight lines 
are hyperplanes and pairs of open half-planes are cocircuit orientations). The 
point geometry can therefore be regarded as a highly developed axiomatic struc- 
ture in combinatorics. The point geometry is also very close to being a “CC- 
System” introduced by Knuth [7], except that Knuth does not allow triples of 
collinear points. 

The open half-planes determined by a straight line L C E are de- 
noted L+jL”. A closed half -plane is the union of a straight line with either 
of its open half-planes. The closed half-plane L U T+ is denoted The con- 
vex hull, denoted [T], of a point set T C E is the intersection of all the closed 
half-planes in E that contain T. Let [T]r denote the convex hull of T in R^. The 
following result is a corollary of Proposition 10. 

Proposition 4. [T] = [T]r n E. 

2.2 Digital Geometry 

The digital geometry of a digital plane is constructed entirely in terms of its point 
geometry. We will show that this geometry then reflects Euclidean geometry to 
the extent that digital convex sets are digital images of Euclidean convex sets. 
Once this correspondence has been established it is almost trivial to prove digital 
versions of major convexity theorems. 

To distinguish clearly between the point and the digital geometries of a digital 
plane, we will sometimes speak of point half-planes, convex point sets for the point 
geometry, and digital half-planes, convex digital sets for the digital geometry. To 
distinguish between these definitions and those of classical geometry, we often 
refer to half-planes, convex sets in R^ as Euclidean half-planes, convex Euclidean 
sets. 

Definition 5. The closed digital half-plane determined by a closed point half- 
plane is the set of all digital cells D such that D f- L or D f] yf 0. 

This definition and our subsequent definition of “digital convex hull” are 
different to those in [9]. 

Example 6. The diagram on the left shows a straight line (the set of crosses) 
and one of its open half-planes. The diagram on the right shows the determined 
closed digital half-plane. 
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From the diagram on the right we can see, informally, that the closed digital 
half-plane is the digital image of the corresponding closed Euclidean half-plane. 
Closed digital half-planes could therefore have been defined as digital images of 
(certain) closed Euclidean half-planes, but we want to emphasize that once the 
point geometry of a digital plane has been set up, the digital geometry can be 
constructed entirely in terms of this point geometry, with no further reliance on 
Euclidean space. 

Closed point half-spaces are convex point sets that are sufficiently well- 
behaved to allow their digital structure to be derived directly from their point 
structure. But not all convex sets have this property: 

Example 7. The set of dark points in the diagram is a convex point set. We want 
the corresponding digital convex set to be the digital image of the Euclidean 
convex hull. But that the 2-cell drawn lies in this image cannot be ascertained 
merely by consideration of its vertices, as none of these lie in the convex point 
set. 
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The example is meant to motivate the following definition, in which we use 
the abuse (which comes from identifying 0-cells with points) that a point set T 
is a subset of a digital set T> if, for each x GT, the 0-cell {x} G T>. 

Definition 8. The digital convex hull of a point set T is the intersection of all 
the closed digital half-planes that contain T. 

Example 9. The diagram shows the intersection of three closed digital half- 
planes, which is the digital convex hull of the point set {a,b,c} together with 
the 2-cell C. This cell must be separated from the digital convex hull using a 
further closed half-plane. 
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The point of the example is to illustrate the sort of problem that may arise 
in developing a theory of digital convexity. Using Proposition 4 is it simple to 
prove that the point convex hull operation has the basic, fundamental properties 
of a convex hull operation in axiomatic convexity theory - see [3], for example. 
In particular, convex hulls and convex sets are interchangeable: a point set T 
is convex if T = [T], and the point convex hull of a set is then the smallest 
convex set that contains it. Digital convexity, on the other hand, is not so well- 
behaved, as the collection of convex digital sets is not even going to be closed 
under intersections. The intersection of the three closed digital half-planes in the 
example is not the digital image of any Euclidean convex set, and so is probably 
not what we would want to count as a convex digital set. 



Proposition 10. The digital convex hull of a point set T is the digital image of 
[T]m. 

Proof. Let be any closed point half-plane, and let be the closed Eu- 
clidean half-plane such that L = M C\ E and L+ = M+ n E, where E is the 
point set of the digital plane in question. Let D be any digital cell and let C 
denote its continuous part. From the theory of Euclidean support hyperplanes 
- see [13], for example - we have that C n = 0 if and only if D C 
and D ^ M . The closed digital half-plane determined by LP'^ is therefore the 
digital image of M°+. It follows easily that the digital image of [T]r is contained 
in the digital convex hull of T. For the converse, suppose D is not in the digital 
image of [T]r. Say that a closed Euclidean half-space is an E-half-space 
if M contains > 2 points of E. To show that D is not in the digital convex hull 
of T we need to find an E-half-space that contains T such that C C M~ . 
It is simple to prove (by considering its facets) that [T]r is the intersection of 
a collection of E-half-spaces. Now C and [T]r are disjoint, so 

by Helly’s theorem (which says that if the intersection of a collection of con- 
vex sets in the Euclidean plane is empty, then the intersection of some three 
of them is empty), there exist i,j such that C n n = 0. If either 
c n M°+ = 0 or C n M°+ = 0 we are done, which is so when D is a 0-cell. 
Assuming otherwise, if D is a 1-cell then there is some E-half-space parallel to 
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the straight line through D that suffices. If D is a 2-cell then there exist E- 
half-spaces N^~ , . . . , iV® ~ such that C is the intersection of , • ■ ■ , . Then 

by Helly’s theorem there is some I such that Nf n n = 0, so then 
T C N°+. 

The result but not the proof given here generalizes to higher dimensions. A 
more coherent proof (which there is not room for here) of the n-dimensional 
result is given in [12]. 

Proposition 11 (Caratheodory’s theorem). For any point set T, [T] is the 

union of the eonvex hulls of the subsets of T having eardinality < 3, and the 
digital eonvex hull of T is the union of digital eonvex hulls of such subsets. 

Proof. Given Caratheodory’s theorem on the first part of the result follows 
easily from Proposition 4 and the second from Proposition 10. 

Proposition 12 (Radon’s theorem). Any point set T that contains > 4 
points admits a partition into two sets whose respective digital convex hulls in- 
tersect. 

Proof. By Radon’s theorem for R^, T can be partitioned into two sets Ti,T 2 
whose respective Euclidean convex hulls intersect. For any point x that lies in 
this intersection, let D be the digital cell whose continuous part contains x. 
Then D lies in the digital convex hulls of both Ti, T 2 . 

Proposition 13 (Helly’s theorem). Let T be any collection of convex point 
sets. If the intersection of any three members ofT is non-empty then the inter- 
section of all the digital convex hulls of members of P is non-empty. 

Proof. By Helly’s theorem for R^, the intersection of the collection of Euclidean 
convex hulls of members of T is non-empty. For any point x that lies in this 
intersection, let D be the digital cell whose continuous part contains x. Then D 
lies in the intersection of all the digital convex hulls of members of T. 

The following example indicates the advantage of ACC’s over graphs with 
regard to Helly’s theorem. 

Example I 4 . In graph-theoretic digital topology the digital plane is composed 
entirely of pixels. A pixel is in effect 2-dimensional, and for digitization purposes 
is regarded as a square in the Euclidean plane. The diagram on the left shows the 
digital image^ of the filled-in Euclidean triangle with vertices (0, 0), (0, 4), (4, 0). 
This image is a digital convex set according to any of the standard definitions 
discussed, for example, in [-5]. Rotate the image through 90 degrees successively 
to get four digital convex sets: the intersection of any three of these is non-empty, 

^ The digital image being the set of all squares (pixels) whose interior intersects the 
triangle. There are of course several different “digitization” definitions in the liter- 
ature, but the point is that the digital triangle is uncontroversially a convex set in 
graph-theoretic digital topology. 
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but no pixel lies in the intersection of all four. The diagram on the right shows 
the digital image in an ACC of the same filled-in triangle, and the intersection 
of the four rotations contains the middle point. 
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3 Computation of Digital Convex Hulls 

In this section we give a brief indication of how a digital convex hull algo- 
rithm can be derived from the convex hull algorithm in computational geom- 
etry. We only go so far as to outline a computation of whether a digital cell 
is in the digital convex hull of a point set; our aim here is merely to indi- 
cate a potential tie-up between computational and digital geometry. There is 
perhaps a potential parallel to be made between this tie-up and the work of 
Winter [14] of using ACC’s in geographical information systems to resolve the 
so-called “raster/ vector” (pixel/point) debate. 

We begin with digital straight lines. Let p, q be distinct points of a digital 
plane and let L be the unique point straight line that contains them. We say that 
the digital line through p, q is the intersection of the digital closed half-spaces 
determined by , L^~ , which is the set of all digital cells D such that D C L 
or D intersects both L+, L~ . 

Proposition 15. The digital line determined by p, q is the digital image of the 
straight line through p,q in 

Proof. ^From Proposition 10 we have that the digital image of the Euclidean 
line through p,q is a. subset of the digital line through p, q. Conversely, if D is a 
digital cell in the digital line through p, q then its continuous part C is either a 
subset of the Euclidean line through p, q or intersects both the half-spaces of the 
Euclidean line. In the latter case C then intersects the Euclidean line because C 
is convex. 

Computation of whether or not a digital cell D lies on the digital line through 
p, q therefore amounts to computation of which side of L each of its vertices lies. 
A standard method (used, for example, in [7]) is calculation of the determinant 
of the relevant matrix: where p = {xi,y\) and q = (0:2, 2/2), on which side of L 
the point r = (0:3, 2/3) lies is given by the sign of the determinant of 
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( xi yi l\ 

a ;2 2/2 1 

2^3 2/3 1/ 

i.e. L,L~^,L~ are the sets of points for which the determinant is, respectively, 
zero, positive, negative. 

We say that the digital straight line segment between p, q is the digital convex 
hull of the set {p, q}. Clearly, this is the set of all digital cells that lie on the digital 
straight line through p,q each of whose vertices r is such that min{xi, X 2 } < 
X 3 < maa;{xi,a:2} and min{yi,y 2 } < 2/3 < m-ax{2/i, 2 / 2 }- 

Digital line segments can be used in a simple computation of whether a digital 
cell D lies in the digital convex hull of a point set T. It is a standard result that 
[T]r is the disjoint union of its topological interior and its topological boundary. 

Proposition 16. A digital cell D is in the digital image of [T]r if and only if 
it satisfies at least one of the conditions: 

1. Some vertex of D lies in the interior of [T]r; 

2. Every vertex of D lies in the boundary of [T]r; 

3. Some point in the continuous part of D lies in the boundary of [T]r. 

Proof. (^): If D has a vertex in the interior of [T]r then there is some neigh- 
bourhood of the vertex that is a subset of [T]r, and this neighbourhood intersects 
the continuous part of D. If every vertex of D is in [T]r then its continuous part 
is a subset of [T]r. (^): If D is in the digital image of [T]r then the continuous 
part of D intersects the interior or the boundary. If it does not intersect the 
boundary then it is a subset of the interior, and then every vertex of D is in 
[T]r. 

The boundary of [T]r can be computed using a convex hull algorithm from 
computational geometry. We refer to Knuth’s convex hull algorithm [7], which 
in effect will, given input T, return the list ui, . . . = v\ of vertices of 

[T]r such that, for each i, the straight line segment between Vi,Vi+i is a facet of 
[T]r. Each Vi G T, and the boundary of [T]r is the union of its facets, so whether 
a digital cell satisfies the third condition of the result amounts to whether it lies 
on the digital straight line segment between some Vi,Vi+i. 

The second condition in the result can also be determined by consideration 
of digital straight line segments between the vertices Vi,Vi+i. 

Whether a vertex x of a digital cell lies in the interior of [T]r amounts, 
for example, to whether, for all i, x lies on the same side of the point line 
through Vi,Vi-f-i as each of the other vertices vj, which can be computed using 
determinants as above. 

4 Conclusions and Further Work 

We have argued that ACC’s are a natural setting for digital convexity in that 
they allow simple digital formulations of classical convexity theorems and allow 
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a link between digital geometry and existing axiomatic finite point geometries 
such as oriented matroids. We have given a brief indication of how computation 
in digital geometry can be derived from standard methods in computational 
geometry. 

This work is really only the beginning of a theory of digital convexity for 
ACC’s. Our version of Helly’s theorem could perhaps be strengthened (although 
it is analogous to the version of Helly’s theorem for oriented matroids - see 
Theorem 9.2.1 in [1]); we don’t know whether, for example, the following stronger 
result holds. 

Proposition 17. Let S be any collection of point sets in a digital plane such 
that the intersection of the digital convex hulls of any three is non-empty. Then 
the intersection of the digital convex hulls of all the sets in S is non-empty. 

At any rate, for general theory of computation over R" we need to consider 
digital convex hulls of sets of cells rather than just sets of points as considered 
here, which we illustrate with the following example. Let p,q be distinct points 
of a digital plane and suppose we want to determine whether a point x C lies 
on the Euclidean straight line segment L between them. It is well-known that 
this cannot necessarily be decided using any finite number of decimal places of x, 
so computation of the answer might not be robust. According to our definition of 
digital images, the point x is represented by the digital cell D whose continuous 
part contains x. The question therefore amounts, in this context, to whether D 
lies in the digital straight line segment between p, q. Uncertainty of computation 
is then expressed by saying that if D is not on this line segment then definitely 
X ^ L, and if D is on this line segment then possibly x G L. But we have not 
in this work considered the more general case of when p, q are arbitrary points 
of M^, to be represented by digital cells T>p, Dq that are not necessarily 0-cells. 
In this case the question amounts to whether D lies on the “digital convex hull” 
of Dp,Dq, which is an operation not defined here. One possible definition is 
to consider the union of the sets of vertices of Dp,Dq and consider the digital 
convex hull of this point set. 
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Abstract. One possible definition of the length of a digitized curve in 
3D is the length of the shortest polygonal curve lying entirely in a cube 
curve. In earlier work the authors proposed an iterative algorithm for the 
calculation of this minimal length polygonal curve (MLP). This paper 
reviews the algorithm and suggests methods to speed it up by reducing 
the set of possible locations of vertices of the MLP, or by directly calcu- 
lating MLP-vertices in specific situations. Altogether, the paper suggests 
an in-depth analysis of cube curves. 



1 Introduction 

The analysis of digital curves in 3D space is of increasing practical relevance 
in volumetric image data analysis, see cited examples of applications in [5,6]. 
A digital curve is the result of a process (path on a 3D surface, 3D thinning 
etc.) which maps captured ‘curve-like’ objects into well-defined digital curves 
(see definition below) . The length of such a simple digital curve may be defined 
based on local chain-code configurations in digital space [6] , or on (global) curve 
approximations in Euclidean space. The latter approach has the potential of 
multi-grid convergence of estimated length toward the true length assuming that 
a digitization model such as cube intersection [6] is used for obtaining test data 
by digitizing curves of known length. For global length estimation algorithms 
there are (at least) two options, the use of 

1. the length of a 3D NSS (naive straight segment) approximation [5] of a 
26-connected curve, or 

2. the length of the 3D MLP (minimum-length polygonal) curve fully contained 
and complete in the tube of a simple 6-connected closed curve [9,10]. 

In earlier work the authors started with studying an algorithmic solution for the 
3D MLP approach and proved that the only possible positions of MLP-vertices 
are the so-called critical edges which are incident with three cubes of the simple 
cube-curve [8] , and presented an iterative algorithm for approximating the MLP 
of a simple cube-curve in 3D [1]. It is expected [5] that the length estimation 
behavior of 3D NSS compares to 3D MLP similarly as that of DSS to MLP 
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in the two-dimensional case [7]. However, to support such experiments we have 
to continue to improve the time complexity of the 3D MLP approximation as 
specified so far in [ 1 ] for making statistically relevant performance evaluations 
feasible. 

The difficulty of the subject may be illustrated by the fact that the Eu- 
clidean shortest path problem (given a finite collection of polyhedral obstacles 
in 3D space, and a source and a target point, find a shortest obstacle-avoiding 
path from source to target) is known to be NP-hard [3]. However, there are 
polynomial-time algorithms solving the approximate Euclidean shortest path 
problem in 3D, see [4]. Our iterative algorithm in [7] is not yet known to be 
always convergent to the exact 3D MLP, or whether it may only approximate in 
well-defined cases, up to some error etc. the correct MLP. All our experiments 
so far suggest that the algorithm presented in [ 1 ] is always convergent to the 
correct 3D MLP, and time measurements also support the hypothesis that its 
run-time behavior is asymptotically linear in the number of input cubes even if 
a very small threshold is used for termination of the algorithm. 

In this paper we review this algorithm and discuss three ways to speed it 
up: 1) Not all critical edges contain MLP-vertices. We identify a subset of these 
irrelevant critical edges which can then be excluded from further calculation. 2 ) 
Certain critical edges can be identified which can only contain MLP-vertices - 
if at all - at their end-points. 3) The positions of MLP-vertices belonging to 
fiat arcs of the MLP can under certain conditions be calculated in closed form. 
Therefore the present paper may also be seen as a step towards an in-depth 
analysis of the geometry of 3D MLP’s in simple cube curves. 

Although we are not able yet to provide a closed form solution for a 3D MLP 
algorithm of a given simple cube-curve, these three steps lead in this direction 
by replacing parts of the iterative procedure by direct computation. 

In the following section the basic notions are introduced. We define the length 
of a simple cube-curve. In Section 3 we summarize our previous algorithm for 
the calculation of this length. In Section 4 the three items mentioned above are 
elaborated. 

2 The Length of Simple Cube Curves 

We start with the definition of simple cube curves, see Fig. 1 for two examples. 
Any grid point (i,j,k) G is assumed to be the center point of a grid cube 
with faces parallel to the coordinate planes, with edges of length 1 , and vertices 
at its corners. Cells are either cubes, faces, edges or vertices. The intersection of 
two cells is either empty or a joint side of both cells. A cube-curve is a sequence 
9 = (/o: Co, /i, Cl, ...,/„, c„) of faces fi and cubes Ci, for 0 < i < n, such that 
faces fi and fi+\ are sides of cube c^, for Q < i <n and fn+i = /o- K is simple 
iff n > 4, and for any two cubes Ci, Ck in g with |* — fc| > 2 {mod n) it holds 
that if Ci n Cfc yf 0 then either \i — k\ =2 {mod n) and Ci fl Ck is an edge, or 
\i — k\ = 3 {mod n) and ct D Ck is a vertex. A tube g is the union of all cubes 
contained in a cube-curve 5 . It is a polyhedrally-bounded compact set in 
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Fig. 1. Two cube-curves in 3D space 



and it is homeomorphic with a torus in case of a simple cube-curve. The cube- 
curve on the left of Fig. 1 is simple, and the cube-curve on the right is not. 
Analogously, edge-curves or face-curves may be defined in 3D space. This paper 
deals exclusively with simple cube-curves. A curve V in 3D Euclidean space is 
complete in g iff it has a non-empty intersection with any cube contained in g. 
Following [9,10], the length of a simple cube-curve g is defined to be the length 
1{V) of a shortest polygonal simple curve V which is contained and complete 
in tube g. A simple cube-curve g is flat iff the center (grid) points of all cubes 
contained in g are in one plane parallel to one of the coordinate planes. 

A non-flat simple cube-curve in specifies exactly one minimum-length 
polygonal simple curve (MLP) which is contained and complete in its tube [10]. 
The MLP is not uniquely specified in flat simple cube-curves. Flat simple cube- 
curves may be treated as square-curves in the plane, and square-curves in the 
plane are extensively studied, see, e.g. [7]. It seems there is no straightforward 
approach to extend known 2D MLP algorithms to the 3D case. 

3 The Iterative Algorithm 

This section contains fundamentals used in our algorithm presented in the con- 
ference paper [1] for calculating the length of a simple cube-curve. Let g be a 
simple cube-curve, and V = {po,Pi^ ■■■, Pm) be a polygonal curve complete and 
contained in g, with po = pm- 

Lemma 1. It holds m > 3 for any polygon V = (po,pi, ...,pm) complete and 
contained in a simple cube-curve. 

The case to = 3 is possible. During a traversal along the curve V we leave cubes, 
and we enter cubes. The traversal is defined by the starting vertex po of the curve 
and the given orientation. Let Cp = (cq, ci, ..., c„) be the sequence of cubes in the 
order how they are entered during this curve traversal. Because V is complete 
and contained in g it follows that Cp contains all cubes of g, and no further 
cubes are in g. 
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Lemma 2. For an MLP V of a simple cube-curve g it holds that C-p contains 
each cube of g just once. 

Now we consider a special transformation of polygonal curves. Let V = 
{po,Pi, Pm) be a polygonal curve contained in a tube g. A polygonal curve Q 
is a ^-transform of 7^ iff Q may be obtained from 7^ by a finite number of steps, 
where each step is a replacement of a triple a, 6, c of vertices by a polygonal se- 
quence a,bi, ...,bk,c such that the polygonal sequence a, bi, ...,bk,c is contained 
in the same set of cubes of g as the polygonal sequence a, 6, c. The case k = 0 
characterizes the deletion of vertex b, the case k = 1 characterizes a move of 
vertex b within g, and cases k > 2 specify a replacement of two straight line 
segments by a sequence of /c -I- 1 straight line segments, all contained in g. 

Lemma 3. Let V be a polygonal curve complete and contained in the tube g of 
a simple cube-curve g such that Cp is without repetitions of cells. Then it holds 
that any ^-transform of V is also complete and contained in g. 



An edge contained in a tube g is critical iff this edge is the intersection of 
three cubes contained in the cube-curve g. Figure 2 illustrates all critical edges 
of the cube-curves shown in Fig. 1. Note that simple cube-curves may only have 
edges contained in three cubes at most. For example, the cube-curve consisting 
of four cubes only (note: there is one edge contained in four cubes in this case) 
was excluded by the constraint n > 4. Based on these lemmata it was possible 
to prove the following theorem [8]: 

Theorem 1. Let g be a simple cube-curve. Critical edges are the only possible 
locations of vertices of a shortest polygonal simple curve contained and complete 
in tube g. 

Note that this theorem also covers flat simple cube-curves with a straightforward 
corollary about the only possible locations of MLP vertices within a simple 
square-curve in the plane: such vertices may be convex vertices of the inner 




/] ^ 71 71 71 71 71 



Fig. 2. Critical edges of two cube-curves 
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Fig. 3. Curve initializations (‘clockwise’) 



frontier or concave vertices of the outer frontier only because these are the only 
vertices incident with three squares of a simple square-curve. 

Our algorithm is based on the following model: Assume a rubber band is laid 
through the tube g. Letting it move freely it will contract to the MLP which 
is contained and complete in g. The algorithm consists of two subprocesses: at 
first (A) an initialization process defining a simple polygonal curve Vq contained 
and complete in the given tube g and such that C-pp contains each cube of g just 
once (see Lemma 2), and second (B) an iterative process (a g-transform, see 
Lemma 3) where each completed run transforms Vt into Vt+i with l(Vt) > 
l{Vt+i), for f > 0. Thus the obtained polygonal curve is also complete and 
contained in g. 

(A) The initial polygonal curve will only connect vertices which are end 
points of consecutive critical edges. For curve initialization, we scan the given 
curve until the first pair (eo,ei) of consecutive critical edges is found which 
are not parallel or, if parallel, not in the same grid layer (see Fig. 2 (right) 
for a non-simple cube-curve showing that searching for a pair of non-coplanar 
edges would be insufficient in this case). For such a pair (eo,ei) we start with 
vertices (po,Pi), Po bounds cq and pi bounds ei, specifying a line segment poPi of 
minimum length (note that such a pair (pojPi) is not always uniquely defined). 
This is the first line segment of the desired initial polygonal curve Vq. 

Now assume that Pi-iPi is the last line segment on this curve Vg specified 
so far, and pt is a vertex which bounds ep Then there is a uniquely specified 
vertex Pi+i on the following critical edge e^+i such that piPi+i is of minimum 
length. Length zero is possible with Pi+i = pi . In this case we skip Pi+i, i.e. 
we do not increase the value of i. Note that this line segment piPi^i will always 
be included in the given tube because the centers of all cubes between two 
consecutive critical edges are collinear. 

The process stops by connecting on edge e„ with po (note that it is possible 
that a minimum-distance criterion for this final step may actually prefer a line 
between and the second vertex bounding cq, i.e. not po). This initialization 
process calculates a polygonal curve Vq which is always contained and complete 
in the given tube. 
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Fig. 4 . Intersection points with edges 



(B) In this iterative procedure we move pointers addressing three consecutive 
vertices of the (so far) calculated polygonal curve around the curve, until a 
completed run t + 1 does only lead to an improvement which is below an a- 
priori threshold r, i.e. l{Vt) — t < l{Vt+i)- In all our experiments the algorithm 
converges fast for a practically reasonable value of r. 

Assume a polygonal curve Vt = {po,pi, ...,pm), and three pointers addressing 
vertices at positions i — 1, i, and i + 1 in this curve. There are three different 
options that may occur which define a specific g-transform. 

(Oi) Point Pi can be deleted iff pi-ipi+i is a line segment within the tube. 
Then subsequence (pi-i,pi,pi+i) is replaced in our curve by (pi-i,pi+i). In 
this case we continue with vertices pi-i,pi+i,pi +2 ■ (O2) The closed triangular 
region A{pi-ipiPi^i) intersects more than just the three critical edges ofpi-i, pt 
and pi+i (see Fig. 4), i.e. simple deletion of pi would not be sufficient anymore. 
This situation is solved by calculating a convex arc (note: a convex polygon is the 
shortest curve encircling a given finite set of planar points [2]) and by replacing 
point Pi by the sequence of vertices qi,...,qk on this convex arc between pi_i 
and pi+i iff the sequence of line segments pi-iqi , . . . , qkPi+i lies within the tube. 
Because the vertices are ordered we may use a fast linear-time convex hull routine 
in case of (O2). Barycentric coordinates with basis {pi-i,pi,pi+i} may be used 
to decide which of the intersection points is inside the triangle or not.^ In this case 
we continue with a triple of vertices starting with the calculated new vertex q^ ■ 
If (Oi) and (O2) do not lead to any change, the third option may lead to an 
improvement. 

(O3) Point Pi may be moved on its critical edge to obtain an optimum 
position Pnew minimizing the total length of both line segments pi-ipnew 
and PnewPi+i- That’s a move on a critical edge and an 0(1) solution is 
given below. Then subsequence (pi_i,pi,pi+i) is replaced in our curve by 
(pi_i,p„eu,,Pi+i). In this case we continue with vertices pnew,Pi+i,Pi+ 2 - 

We consider situation (O3), i.e. Pi lies on a critical edge, say e, and is not 
collinear with Pi_iPi^i- Let be the line containing the edge e. First, we find 
the point p^pf € le such that 

\Popt - p^-l I + \p^+l - Poptl = min T(p) 

pGie 



^ In the majority of such cases we found fc = 1, i.e. pi is replaced by qi. 
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with 



L{P) = (|p-p*-i| + |p*+i -pD- 



If Pgpj lies on the closed critical edge e we simply replace Pj by If it does 
not, we replace p^ by that vertex bounding e and lying closest to p^pt- 

We give a slightly simpler solution for finding the point p^p^ then provided 
in [1]. W.l.o.g. let us assume that le is parallel to the a:-axis: 



with ai = {ue - ViY + {ze ~ ZiY . 

Figure 5 illustrates a given simple cube curve by its critical edges only, the 
initial polygon and the correct MLP as calculated by our algorithm. Note that 
some of the vertices subdivide critical edges in a rational ratio specified by the 
heights of consecutive linear cube segments. 



Fig. 5. Initial polygon (dashed) and correctly calculated MLP. Critical edges are 
shown as short line segments. The rest of the tube is not shown 

We illustrate the performance of the algorithm with respect to length mea- 
surement on circles in We generate circles by 




leads to a quadric equation in Xopt^ 
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Fig. 6. Results of the length estimation of digitized circles with resolution 100^ 
to 1000^ using T = l{Vt) ■ 10"^ 



Here, Rx and Rz are the 3x3 rotation matrices about the x- and the 2;-axis, 
respectively. The angles (f>i , </>2 and ^3 are randomly chosen from a uniform dis- 
tribution in the interval [0, tt]. The radius r is randomly chosen from the interval 
[0.5, 1]. In each experiment a circle is generated and digitized on a 3D grid. The 
grid size varies between 100^ and 1000^. We performed experiments using differ- 
ent thresholds r, and Fig. 6 illustrates the case of r = l{Vt) ■ 10“”^ and resolutions 
(100 • n)^,n € {1, 2, . . . , 10}. For each combination we estimated the length of 
25 digitized circles. We normalized the results such that the true circumference 
of the underlying circles is tt. 

4 Analysis of Cube Curves 

In it’s current form the above algorithm is slower than necessary. It can be sped 
up by taking into account more of the available information about the possible 
positions of vertices of the MLP. Here we propose three ways of doing so: 1) Not 
all critical edges are relevant, i.e., not every critical edge can contain a vertex 
of the MLP. Removing irrelevant critical edges reduces the number of necessary 
computations. 2) In certain situations a critical edge can contain a vertex of the 
MLP only - if at all - at one of its endpoints. This information can be used 
for the initialization of the algorithm. 3) Consider a subsequence of the MLP 
which is contained in one fixed grid-layer. Assume the correct endpoints are 
known as well as the critical edges which contain the intermediate vertices of 
the MLP. Under these conditions the positions of the intermediate vertices can 
be calculated in closed form. 

Considerations as started in this section may lead to a better understanding 
whether it is possible to derive a non-iterative, i.e. closed form of an algorithm 
calculating 3D MLPs within simple cube curves. 
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Fig. 7. A subsequence of a simple cube curve lying entirely in one cube-layer 
(top). Its projection to a 2D face-curve. The vertices marked by dots indicate 
the corresponding critical edges 



4.1 Removing Irrelevant Critical Edges 

Consider a flat arc of a simple cube curve, i.e. an arc all of which cubes lie within 
one cube-layer. As shown in Fig. 7 this arc can be projected to a 2D plane, where 
the cube-curve becomes a face-curve. The 2D MLP algorithm reported in [7] may 
be used for such a planar segment of a simple cube curve. The following theorem 
also shows how in this case irrelevant critical edges can be identified. 

Theorem 2. Let E = {e^, e^+i, Ci+2, e^+s, 6^+4} be a sequenee of eonsecutive, 
parallel critical edges. Let ei,ei+2 and Ci+4 he coplanar. Consider the projection 
of the cube-curve segment containing E as in Fig. 7. Then e^+2 contains no 
vertex of the MLP if line segment eiei+4 intersects the boundary of the face- 
curve at Ci+2 only. 

Figure 8 illustrates the situation and no further proof is needed. 

4.2 Vertices on Endpoints of Critical Edges 

We again investigate flat arcs of the cube-curve. The following theorem shows 
that in a situation like the one shown in Fig. 9 the only possible positions of 
MLP-vertices are the endpoints of critical edges. 

Lemma 4. Let E = (e^, e^+i, . . . , ei+„_i) be a sequence of consecutive critical 
edges. Lf all these Cj ’s are pairwise parallel then they lie within the same grid 
layer (cube-layer). 

Proof. Moving into another grid-layer results in a critical edge in the plane 
between both grid layers, i.e. which is not parallel to the ej’s in the sequence E 
before. 
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Fig. 8. Illustration of Theorem 2 . Left: 63 contains a vertex of the MLP. Right: 63 
does not contain a vertex of the MLP and can be removed from the list of critical 
edges 




Fig. 9. Entering and leaving in same direction 



It holds that E = (e^, e^+i, . . . , Ci+n-i) is a maximum-length sequence of parallel 
critical edges iff all the e^’s contained in E are pairwise parallel and e^-i and 
are not parallel to (and thus not parallel to any edge contained in E). 

Theorem 3. Let E = (e^, e^+i, . . . , ei+„_i) he a maximum-length sequence of 
parallel critical edges. If Ci-i and ei+„ are both in the upper (lower) face-layer, 
the only possible vertex positions on the edges e^, . . . , Ci-n-i are the upper (lower) 
endpoints of these edges. 

Note, that by speaking about “upper” and “lower” face-layer, we assumed a 
horizontal layer. The same theorem holds if we replace “upper” and “lower” by 
“left” and “right” or by “front” and “back” . 

Proof. The shortest path connecting an edge-sequence E = (e^, e^+i, . . . , ei+„_i) 
lies within a horizontal plane, i.e., the positions of all vertices have the same 2:- 
coordinate (x-coordinate for “left” and “right”, y-coordinate for “front” and 
“back”). Since the left and right neighbors of E are both above (below) the 
cube-layer, it is optimal to move the vertices to the upper (lower) endpoints of 
the edges e^, . . . , Ci-n-i. 
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4.3 Closed Form Evaluation of Vertex Positions 

Let (ei,ei+i . . . be a sequence of consecutive parallel critical edges and 

E = (ci, Cj . . . , 6 k) the sequence which results from the former sequence by 
removing all the critical edges containing no vertex of the MLP. W.l.o.g. we 
assume that B lies within a horizontal cube-layer. 

Consider the sequence of the vertices (pa , Pa , Pej , ■ ■ ■ , Pe^ j Pb) being a subse- 
quence of the MLP. Here Pa and pb are assumed to be known, and Pe denotes 
the vertex lying on e. Define 

I = \PaPei \ + \PeiPej | + ■ • ■ + \PekPb\, 
and let {p)z be the z-coordinate of the vertex p. 

Theorem 4. For the specified situation it holds that the z-coordinates ofpsi,Pej , 
. . . , Pet, are given by linear interpolations 

\ \ I (Pb)z ~ {Pa)z 1^ I 

(Pei)z — [Pa)z + ^ |PoCi|, 

(Pej)z — [Peilz F ^ piCjl, ... 

Proof. We represent by the set of points in which lie on ep. 

Ci = {{xi,yi,z + Xi)\Xi e [0,1]}. 

We introduce a function cr acting on the indices such that 

^ (^(7(1)5 ^o-( 2 ) • • • 1 ^cr(m)) • 

Further we denote the positions of the vertices Pa and pb by 

Pa (^{7(0} 5 yf7(0) ; ) and Pb (^(7{m+l) 5 yf7(m-t-l) ) '^f7(m-t-l) ) ■ 

The length of the sequence {pa,Pei,Pej, ■ ■ ■ iPekiPb) is thus given by 



h^i.Xa{l) 5 ■ • • ; Xeri^rn) ) ^ ^ Y b^a(i+l) 1 )^ “k '^(7(2+!))^ 5 

where (|e,^(i)e,^(,+i) l)^ = - x„(i+i)f + {ya(i) ~ ya(i+i)f- Since L does not 

depend on the Xi and yi directly but merely on the horizontal distances between 
consecutive critical edges |eo.(qecr(i+i) | we can replace E by E' with edges 

e' = {(x',0 ,A.)|A.G [ 0 , 1 ]} 



such that 



^<7(i) ~ \^(T(i)^cr(i+l)\ 
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which leads to 



i-1 

Minimizing L with respect to Acr(i), • ■ )^a(m) leads to 

\ \ I 

'^cr(z) '^tr(O) ' / 



x',^ , i \ — a:' 



Z^(t' - t' \ 

_ j ^<r(0)h 

cr(m+l) ‘^cr(O) 



Converting to the previous notation yields the formulae in the theorem. 

Figure 10 shows a simple situation in which Theo. 4 can be applied. We finally 




Fig. 10. An example for the application of Theo. 4: The position of the middle 
MLP-vertex can be calculated using the theorem. In this case A = 1/3. An arc 
of the MLP is represented by the thin line. The bold edges of the cube curve are 
the critical edges 

present an example of a cube-curve with an MLP which can be calculated in 
closed form (see Fig. 11). The vertices in the left and right arcs, which lie in the 
Y-Z plane, can by Theo. 3 found to lie on the endpoints of critical edges. Which 
of the critical edges in these arcs are effective can be found by a 2D method for 
face-curves. The latter is also true for the two other arcs which lie in the X-Y 
plane. The locations of the vertices on the effective critical edges of these arcs 
can be found by Theo. 4. 

5 Conclusion 

In this paper we reviewed an algorithm for iterative approximation of 3D MLP 
of a simple cube curve and initiated a discussion whether it is possible or not 
to derive a closed-form, i.e. non-iterative algorithm for this computational prob- 
lem. We have shown that certain irrelevant critical edges can be identified, and 
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Fig. 11. The MLP of the shown cube-curve can be calculated in closed form 
using the presented methods. See text for details 



how the positions of some MLP-vertices can directly be calculated without an 
iterative procedure. These first steps towards a better understanding of simple 
cube curves are based on a segmentation of 3D cube curves into 2D face curves. 

Triplets of run-length sequences may play a crucial role in analyzing simple 
cube curves. Starting with one cube of a simple cube curve we may assign one 
sequence of numbers to any of the three coordinate axes defined by consecutive 
run length’ parallel to this coordinate axis (i.e. ‘how many consecutive cubes are 
in the same layer with respect to the coordinate axis?’). These sequences may be 
calculated during a first run through the given cube curve. For example, starting 
at the lower left cube of the simple cube curve shown in Fig. 1 on the left, and 
going to the right, we have sequences 

1,1,1,3,6,1,1,1,1,4,4+ 

(4-1- indicates that this run is in the same layer as the first cube) for the X- 
coordinate axis (left to right). 



4,4,2,7,1,1,5+ 

(again continuation of the same layer as the first 4 cubes) for the foreground-to- 
background P-axis, and 

5,2,2,2,10,1,1,1 

for the bottom-to-top Z-axis. This shows that the longest run of 10 cubes appears 
in one P-layer, matching the situation of Theorem 3, followed by a run of 9 cubes 
in one P-layer. Such a longest run might be a good start for a 3D MLP algorithm. 
An analysis of these run-length sequences is also expected to allow a more direct 
calculation of (possible) positions of MLP vertices on critical edges. Figure 12 
shows that these may be at irrational positions. 

The evaluation of the practical relevance of algorithmic proposals (as dis- 
cussed for the speed-up of the iterative algorithm) is particularly difficult, since 
the effect of these depends highly on the class of specific cube curves under con- 
sideration. For example, it is easily possible to construct curves with no irrelevant 
critical edges at one hand, and curves containing almost only irrelevant critical 
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Fig. 12. A cube curve with an MLP-vertex at an irrational position: As a con- 
sequence of Theo. 4 the vertex marked by the dot is at A = -\/2/(2 -|- v^) 



edges, on the other hand. Similarly, the applicability of Theorem 4 depends on 
the length of flat arcs in the cube-curve. 

The effect of methods such as presented here on the complexity of the dis- 
cussed iterative algorithm will be subject to future research, in close relation to 
methods modeling or generating specific classes of cube curves. For example, it 
will be worthwhile to investigate whether Theorem 3 allows the identification 
of at least two MLP- vertices for any given cube curve. This would allow a seg- 
mentation of a cube curve into two cube arcs, and the independent treatment of 
both arcs. 
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Abstract. We propose in this paper an arithmetical definition of 3-D 
discrete lines as well as an efficient construction algorithm. From this 
notion, an algorithm of 3-D discrete lines segmentation has been devel- 
oped. It is then used to calculate the length of a discrete curve. A proof 
of the multigrid convergence of length estimators is presented. 

Keywords: 3D discrete line. Segmentation, Length estimation. Discrete 
curve. 



1 Introduction 

A formal definition of a 3-D discrete straight line is mandatory in numerous 
applications using objects based on voxels [31,16,14,15,6]. For example, drawing 
3-D lines between two points of the discrete space is fundamental in discrete ray 
tracing. The existing methods may be sorted in two categories, a summary is 
given in [6,13]; 

— “by projections” methods, where two projections of the 3-D segment are 
independently computed on two basic planes [1] 

— “by direct calculus” methods obtained by extension of the 2-D and 3-D 
algorithms [16,5] 

Several authors have proposed definitions of 3-D 26-connected lines [17,29] and, 
as in 2D, it seems to be interesting to propose a simple arithmetical definition 
of 3-D discrete lines including the different structures obtained by the known 
drawing algorithms and suggesting a controlled thickness [7]. 

A more general arithmetical definition is given in [12] but the proposed defini- 
tion in this paper is sufficient for the presented work and allows to get a simple 
algorithm for segmentation and 3D discrete curves length estimation. In the sec- 
ond section of this paper, we give an arithmetical definition of 3-D lines as well 
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as a structure theorem allowing the control of the connectivity of 3-D discrete 
lines so defined. Moreover, a very simple scanning algorithm is proposed which 
is equivalent to those already known using “by projections” methods. It only 
relies on calculations on integers. In the third section, a recognition algorithm 
of naive 3-D lines segments is given ; it relies on the functionality property of 
a naive 3-D line and the recognition algorithm of 2-D naive lines segments [9]. 
Then, a linear segmentation algorithm of 3-D discrete curves is directly deduced 
from the recognition algorithm of 3-D lines segments. In the last section, an 
application of this algorithm to the calculation of 3-D discrete curves length 
is proposed as well as the comparison with other methods. The convergence of 
this length estimation technique is demonstrated and several examples are given. 



2 Definitions and Main Properties 

2.1 3-D Discrete Line 

To simplify the writing, we give a definition of a 3-D discrete line whose main 
vector is (a, b, c) G with a> b> c. The definitions for coefficients ordered in 
a different way are obtained by permuting x, y, z as well as the coefficients. We 
suppose in the following that a > b > c, a, condition to which it is easy to come 
back by symmetries. 

Definition 1. The 3-D discrete line, called T>, whose main vector is V (a, b, c), 
with (a, b, c) G , such that a > b > c is defined as the set of points {x, y, z) of 
7Z^ verifying the following diophantine inequalities: 



jp,<cx — az<p + e (1) 

\ p' < bx — ay < p' + e' (2) 

with p, p\ e, e! G 7Z . p and p' are called the lower bounds of T>, more- 
over, e and e! are named the arithmetical thickness of T>. Such a line is 
noted T>{a, b, c, p, p' , e, e') . 

Remarks: 

1. For a given main vector, there is a large number of different representations 
according to the lower bounds and thicknesses chosen (see Fig. 1. and Fig. 

3.). 

2. In the plane Oxz (resp. Oxy), the double inequality (1) (resp. (2)) represents 
a 2-D arithmetical line [26] with lower bound p (resp. p') and arithmetical 
thickness e (resp. e'), noted T> 2 d{c, a, p, e) (resp. T> 2 Dib,a, p' ,e')). 

3. The double diophantine inequalities (1) and (2) define two discrete planes of 

^3 ([ 2 , 8 , 7 ]). 
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Fig. 1. Representation in voxels of I?(10, 7, 3, 0, — 9, 13, 17), a 6-connected line 
where a point (a:, y, z) of V is represented by a unit cube centered at the integer 
point (x, y, z) 



Definition 2. We call 3-D naive lines, whose main vector is V{a,h,c), with 
(a, b, c) € , such that a > b > c, the 3-D discrete lines whose thickness verifies 

e = e' = a, they are named D{a, b, c, y, y!) . 

Remark: A 3-D naive line is projected into the plane Oxz and Oxy in two 2-D 
naive lines [26,9], noted I? 2 r>(c, a, y) and a, /r'). 

Theorem 1. A 3-D naive line D{a, b, c, y, y') with a > b > c is functional in x. 
Proof. Indeed, let us consider the naive line T>{a^b,c, y, y'), V is defined by 



(y<cx — az<y-\-a 
\y' <bx — ay < y’ -\- a 

which is equivalent to 




where [. . . ] denotes integer part of x. 



( 3 ) 

( 4 ) 



2.2 Relation between Arithmetical Thicknesses and Connectivity of 
3-D Discrete Lines 

In order to get a better understanding of the 3-D discrete line notion, here is the 
structure theorem which establishes a link between the arithmetical thicknesses 
and the connectivity of a 3-D discrete line. 

Theorem 2. Let us consider V{a, b, c, y, y' , e, e') with a > b > c, 

If e > a-\- c and e' > a -\-b, V is 6-connected (see Fig 1.). 

If e > a-\-c and a < e' < a-\-b or e! > a-\-b and a < e < a-\-c, D is 18-connected. 
Ifa<e<a-\-c and a < e' < a -\- b, V is 26-connected (see Fig 3.). 

If e < a or e' < a, V is disconnected. 
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Fig. 2. 



Proof. In order to demonstrate these results, we use the fact that projections 
of T> in the planes Oxy and Oxz are 2-D discrete lines whose connectivity is 
known [26]. 

For example, let us consider the third case ; a < e < a + c and a < e' < a + b, 
in the plane Oxz, we move from a pixel to the next one in the discrete line 
y<cx — az<y + e through a 4-connected or 8-connected move, the same in 
the plane Oxy for the line y' < bx — ay < y' + e' . All possible combinations 
between two pixels of planes Oxy and Oxz to get two voxels of have been 
studied on the figure 2. 

a. an 8-connected move in the plane Oxz and a 4-connected move in the plane 
Oxy lead to two voxels linked by an 1 8-connected move, 

b. a 4-connected move in the plane Oxz and an 8-connected move in the plane 
Oxy lead to two voxels linked by an 1 8-connected move, 

c. an 8-connected move in the plane Oxz and an 8-connected move in the plane 
Oxy lead to two voxels linked by a 26-connected move, 

d. a 4-connected move in the plane Oxz and a 4-connected move in the plane 
Oxy lead to two voxels linked by a 6-connected move. 

Therefore we show that the line V is 26-connected. 

The other results can be demonstrated in a similar way by watching all combi- 
nations of two voxels obtained from the possible ones of two pixels in the planes 
Oxz and Oxy. 

Remark: The naive lines are 26-connected and we go from a floor to the next 
one only through an edge or a vertex, we can see it on the figure 3. 

The next theorem states that there is a link between the 3-D GIQ (Grid In- 
tersect Quantization) [13] and the 3-D straight line defined above. We prove that 
this digitization process digitizes an Euclidean straight line into a 26-connected 
set of voxels such that 2 of its 3 canonical projections are 2-D discrete straight 
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Fig. 3. On the left iD(10, 7, 3, 0, 0), on the right I?(10, 7, 3, —5, 0) 



lines. Note that in a general case, Jonas et al. [13] show that none of the clas- 
sical digitizations into 26-connected curve has the t/iree-projection property {i.e 
projections of a 3-D chain onto each axis planes are identical to the digitization 
of the continuous curve projections). 

Theorem 3. The 3-D GIQ (Grid Intersect Quantization) of an Euclidean 
straight line is a 3-D discrete straight line defined above. 

Proof. Without loss of generality, we consider I?euc a 3-D Euclidean straight line 
whose main vector is (a, b, c) € T? with a > b > c, defined by: 



We consider the Euclidean planes Vi : z = and V 2 ■ y = The 

GIQ-digitization of these planes gives the following arithmetical naive planes: 



where V{u,v,w,y) : y < ux -\- vy -\- wz < y -\- max{\u\,\v\,\w\) [2,8]. In the 
following, we consider a free ambiguity GIQ process [13] (we assume a global 
strategy to remove pathological cases) . 

The intersection between this two discrete naive planes defines a 26-connected 
discrete line according to definition 1 and the structure theorem 2: 



Let ?; be a voxel of D, we consider GIQ{Deuc) the GIQ-digitization of Deuc- 
If V belongs to V, according to the GIQ process of the planes, we have {d denotes 
the classical Euclidean distance) : 





(6) 




d{v, Vi) < V 2 and d{v, V\) < d{v, Deuc) < 



( 8 ) 
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This is equivalent to: 

GIQiV^uc) (9) 

and thus 

V c GIQ{V,uc) (10) 

To prove the symmetric inclusion we consider v' G GIQ(T>euc), ii v' ^ T> we have 

two possibilities: 

— d{v',Vi) > or d{v',V 2 ) > that is equivalent with d{v' ,T>euc) > 
which leads to a contradiction with the GIQ process of T>euc 
~ d{v' ,T>euc) < but since the GIQ is free ambiguity, v' disconnects T> which 
leads to a contradiction with the structure theorem 2. 

Finally: 

V = GIQ{V,uc) (11) 

□ 



2.3 A Scanning Algorithm of 3-D Naive Discrete Lines 

Let us consider A{xa,IIa, za) and B{xb,Vb, zb) two points of ZZ^ . The al- 
gorithm given below draws a naive line segment between the two points A 
and B whose main vector is AB = {vx,Vy,Vz). According to the choice of 
the lower bound, different structures are obtained. In particular, for /r = /i' = 
_] max(\va:\,\vy\,\v^\) T^ _l_ generated voxels are the same as those defined by 

the algorithm of Bresenham extended to 3D [4,31]. 

The principle of the algorithm consists in computing in parallel in both planes 
Oxy and Oxz the pixels of the 2-D naive lines which are the projections of the 
3-D line in these two planes. So we recalculate the three coordinates (x, y, z) of 
the corresponding voxel. 



Algorithm for the construction of a naive discrete line segment with lower bounds y, and 
y between two points A(xa,Ya,za) and B(xB,yB,ze) of such that xb — xa > ys — Ya 
and Yb - Ya > Zb - za. 



Vx = xb - xa; Vy = Yb - ya; Vz = zb - za; 
n = Vz * xa - Vx * Za - y, 
t2 = Vy * xa - Vx * Ya - y'-, 
x = xa; y = Ya; z = za; 

While X < Xb repeat 

X = X -h 1; 
n = n -h Vz; 

t2 = t2 + Vy; 

If ri < 0 or ri > Vx then 
z = z -h 1; 
ri = n - Vx; 

Endif 

If t 2 < 0 or r 2 > Vx then 
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y = y + i; 

r2 = r2 - Vx; 

Endif 

Print the point {x,y,z); 

EndRepeat 



This algorithm is linear in the number of points to be computed and very simple. 



3 Algorithm for 3-D Naive Line Segment Recognition 

The recognition of 3 -D naive line segments is a direct outcome of the recognition 
of 2 -D naive line segments [ 9 , 7 ]. Indeed, a 3 -D naive line segment is bijectively 
projected into at least two orthogonal planes as two 2 -D naive discrete straight 
lines. Consequently, recognizing a 3 -D naive line segment consists in recogniz- 
ing both 2 -D naive line segments of its functional projections. Hence the linear 
algorithm proposed in [ 9 , 7 ] is particulary adapted to this problem of 3 D curve 
segmentation into naive lines, even more than other linear algorithms of 2 D curve 
segmentation like [ 11 , 25 ]. 



Algorithm for 3-D naive line segment recognition 

Input: 26-connected sequence of voxels to be analysed, named 5 
Output: True if 5 is 3-D naive line segment, False if not 

If the voxels of 5 may not be bijectively projected on at least two orthogonal planes 
in order to create two curves of pixels Ci and C2 then S is not a 3-D line segment, return 
False; Endif 

Apply the algorithm of 2-D naive line segments recognition on Ci and C2; 

If Cl and C2 are two 2-D naive line segments then 
5 is a 3-D naive line segment, return True; 

Else 5 is not a 3-D naive line segment, return False; 

Endif 



This algorithm is linear in number of points of the sequence to be analysed 
and all necessary calculations are exclusively done using integers. 

The characteristics of the recognized line segment are determined as a function 
of the characteristics of the 2 -D line segments obtained. For example, let us sup- 
pose that S is bijectively projected on the plane Oxy curves Ci and C2. The 
recognition algorithm of 2 -D naive line segments recognizes C\ as a segment of 
F’2 d(ci, oi, /ii) and C2 as a segment of T’2 d(^ 2, <22, At2)- Let us consider m the 
smallest common multiple of a\ and 02 such that m = k\ai = ^202 then S is 
a 3 -D naive line segment whose main vector is 1^(771,^262,^1^1) and with lower 
bounds and y.2- 
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4 Segmentation of 3D Discrete Curves 

The segmentation of a 26-connected discrete curve, named C, consists in “split- 
ting” C into maximal length segments of 3-D naive lines. To do so, we use the 
same principle as the algorithm presented in the previous section, but, to achieve 
a linear complexity, the search of functional projection planes is not executed 
in the same way. We proceed by increment ; three 2-D discrete curves are built 
and segmented along the recognition, they correspond to the projections of the 
curve part already scanned in the three coordinates planes. 

The algorithm given below uses the same bases as the segmentation algorithm 
of 2-D discrete curves presented in [9,7]. A sequence C of 26-connected voxels is 
given as input. 



Algorithm of segmentation of 26-connected discrete curves 
M = first voxel of C 

While C has not entirely been scanned repeat 

In each coordinate planes Oxy, Oxz, Oyz, init the characterictics of new segments 
Si, 52 and 53 ; 

SEGMENT=true; 

While the added voxel M possesses at least two projections in which the projected 
points of M can be added to the current 2-D segments [9,7] and SEGMENT 
repeat 

If the projections of the following voxel of C in the two coordinates planes used 
belong to the same octants as the current 2-D segments then 
M = following point of C; 

Add by symmetry the projected pixels of M, to the 2-D segments concerned; 

Else 

SEGMENT = false; 

Endif 

End repeat 

According to symmetries, compute the characteristics of the naive lines of the two 2-D 
segments obtained in the initial octants, then deduce the main vector of the recognized 
naive line segment; 

End repeat 



Remarks: 

1. The first point of a new segment is the last of the previous segment. 

2. An algorithm for 4-connected 2D line segment recognition is presented in [7] 
and it’s easy to use it (as presented before) for the segmentation of 18-connected 
discrete curves. 

Example: On the figure 4 we can see an example of the segmentation of a dis- 
crete curve, the starting and ending segment voxels are in black. The projection 
planes used are Oxy and Oyz, the segments resulting from the 2-D recognition 
are drawn in light gray on the projections in these two planes of the considered 
voxels set. Two naive line segments are obtained: 
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Fig. 5. The black voxels indicate the starting and ending segment. The 2-D 
segmented curves are in light gray 



Segment 1 which contains 13 points. The inequalities of the discrete naive lines 
of the 2-D segments obtained are —4 < —Ay — 5z < 1 and —2 < —2y — 5x < 3. 
Consequently, the 3-D naive line of the segment 1 has V (2, —5, 4) as main vector. 
Segment 2 length is 6. The inequalities of the discrete naive straigth lines of the 
2-D segments obtained are 0 < y — 2z < 2 and 0 < y — 2x < 2. Consequently, 
the 3-D naive line of the segment 2 has F(l,— 2,1) as main vector. 

Other examples of segmentation of 26-connected curves are given in the figure 5. 

5 Length Estimation Algorithm 

In 2D, a lot of approaches have been proposed for the length estimation of a 
discrete curve. These approaches can be sorted as follows: 

— Local approaches [23,10]: estimators are based on local or fixed size window 
characteristics of the discrete curve. An example of such estimators is based 



308 David Coeurjolly et al. 



on counting the number of odd and even codes in the Freeman chain code 
of the curve and returning the length estimation I = auodd + (Uneven where 
a and /3 are computed using statistical analysis {e.g (a,/3) = (1.183,1.059) 
in [10]). Such estimators exist for 3D curves [19,18] in a same approach : find 
the optimal weights associated to different chain codes. In the following, in 
spite of advantages of such approaches, we only deal with global algorithm. 

— Discrete Straight Segments (DSS) approaches [24]: these methods are defined 
by a curve partitioning into digital straight line segments of maximum length 
each. Hence, the length estimator is based on the obtained polyline length. 

— Minimum Length Polygon approaches (MLP) [27,28,21]: these methods com- 
pute a minimum length polygon that belongs to an open boundary of a digital 
object. 

— Euclidean Path approaches: in these approaches, a semi-continuous repre- 
sentation of the discrete curve is computed using Euclidean points located 
in a open square of size one centered on each discrete points. Since this Eu- 
clidean path provides a good approximation of the underlying real curve, 
a length estimator on this path can be defined either as the path polyline 
length [30] or embeded in an MLP process [3]. 

In the 3D case, Klette and Bulow [20] propose a 3D version of the MLP 
approach. The input of the algorithm is a closed cube-curve that is a strictly 6- 
connected closed curve, the minimum length polygons is computed in two steps 
(see figure 5): 

1. The vertices of the candidate polygon are first initialized to the inner vertex 
of edges of the cube-curve that are adjacent to three distinct cubes; such 
edges are called critical edges; 

2. local operations (move or delete) are computed on the polygon vertices to 
converge to the minimum length polygon. 

This first 3D global algorithm for the length estimation leads to several prob- 
lems: the first one is purely algorithmic since this algorithm is not linear in the 
number of points of the discrete curve. This non-linear complexity is due to 
the local optimization processes that introduce backtrack on the polygon vertex 
chain. Since a vertex of a candidate ML-polygon can be moved along a critical 
edge, the calculated vertices of the polygon might not belong to the discrete grid 





Fig. 6. An example of a cube-curve with its ML-polygon 
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and we have to introduce an t-MLP that is the minimum length polygon up to a 
threshold r {i.e. \lr-MLp{C) — Imlp{C)\ < r where C denotes the cube-curve). 
Furthermore, there is no multigrid convergence proof of this MLP 3D version. 
In the following, we propose a length estimator based on the 3D naive discrete 
straight line segmentation. The interesting points of such an algorithm are: 

— a purely discrete and optimal in time algorithm; 

— multigrid convergence of the length estimator whatever the curve is closed 
or not and whatever the dimension; 

~ since the segmentation process holds whatever thickness has the curve, we 
have a general length estimator whatever the topology of the curve. 

We first describe the algorithm based on the Naive Straigth line Segmenta- 
tion (NSS): 



NSS length estimation algorithm 

Input: 26-connected sequence of voxels to be analysed, named S 
Output: the estimated length of S 

Compute the segmentation using the above algorithm of S 

Let V ={S'i}o...n denotes the polyline returned by the segmentation. The vertices of this 
polyline are the starting and the ending points of the naive line segments. 

Return (where l{Si) denotes the Euclidean length of the segment Si) 



For the sake of clarity, this algorithm is designed as a post-processing of the 
segmentation but it can easily be embedded in the segmentation process since 
it is purely greedy. 

In the next section, we present a proof of the multigrid convergence of a slight 
variant of NSS. 

5.1 Proof of the Multigrid Convergence 

In [24,22], Kovalevsky and Klette prove the multigrid convergence of the 2D DSS 
approach. The main result of this proof is that the error between the boundary 
length, denoted by Be, of an Euclidean convex object and the estimated length 
using a DSS segmentation of the digitized boundary, denoted by Bqs on a grid 
of size S, is linear with <5: 



\l{Bc) - Idss{Bcs)\ < 0{6) (12) 

In the following, we propose a proof of this asymptotic convergence under 
the curvature hypothesis without the convexity one, and that remains true in 
any dimension. In the sequel we assume the following: 

1. r : [0, 1] — > denotes the underlying continuous curve whose curvature is 

bounded by C = 4. 
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2. r is included in the ^-enlargements of tubes Ti,. . . ,Tn of diameter S and 
lengths L{Ti),. . . ,L{Tpf). The ^-enlargement of a tube is the union of this 
tube and two parts as described in figure 7 (for negative values of S, the 
notion of enlargement is similarly defined) . 

3. Ti starts at T(0), within KiS error. 

4. Tjv ends at T(l), within K 2 S error. 

5. Ti’s end is T^+i’s start. 

6. L{Ti) > for any i < N. 

We approximate the length of T by the length sum of the T^’s. This tube 
approach is a general framework for discrete polygonalization schemes. 

The begin and end of a tube are the intersection of the axis with the lateral 
faces (extremities). The pertinence of the last hypothesis is discussed later in the 
case of particular discretizations. Moreover, we assume that 5 is small enough 
to avoid pathological cases such as half-turns in a tube (| < R). The goal is a 
bound on the error, linear in S. Notice that the hypothesis (2) implies that F 
gets out of the (— 5)-enlargements of the R’s through lateral faces. 

We first give an overview of the proof: Lemma 1 shows that the angle of F 
within a tube is small enough to achieve a linear error in <5 m a tube (Corollary 1). 
In Lemma 2, we show an upper bound on the angle between two adjacent tubes 
that leads to an upper bound on the error while joining two tubes. Under the 
sixth hypothesis, we have a sufficiently small number of tubes to achieve a linear 
bound on the summed error of joinings (Corollary 2). Corollaries 1, 2 lead to a 
global linear bound on the error. 

Lemma 1. The angle between F^ and the axis of R in the (—6) -enlargement 
of Ti is lower than B = arcsin(-j^) -|- L(Ti)/R. If a such that tana = \J 2 R - 5 
verifies i?sin(a) < LiTf), then the angle is bounded by B = a. 

^ This angle is the maximal one between a tangent of F and the tube axis. 



6/2 T 6/2 




Fig. 7. Left: ^-enlargement of tube T. 
is the tube length, the dashed line is 
(— 5)-enlargement of the tube 



L-6 




Right: schema for proof of lemma 1: L 
the tube contour, the solid line is the 
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Proof. See figure 7 : this is the case of largest negative derivative for a projection 
on a plane including the direction of the tube. The following holds: 

n<L/R (13) 

sin0 < (14) 

This yields the first result. The particular case in which i?sin(a) < L{Ti) for 
tan a < 2 R-s illustrated in figure 8; in this case, L is large, and the derivative 
must be small enough to keep P in the enlargement of the tube. This proves that 
the angle is bounded by arctan( □ 

This bound is 0{^/ SC) under the above assumptions. Moreover, this proves 
that the derivative of T in a parametrization colinear to the axis of the tube 
is bounded by y^l + (d — l)p^ with p = sin{B). This bounds the difference 

between the length of P in Ti and the length of Ti by + {d— l)p^ — 1): 

Corollary 1. The difference between the length ofTi and the length of P in Ti 
is hounded by L{Ti){^Jl + {d — l)p^ — 1). This is linear in LiTf) and SC. 

This is linear in <5, but does not complete the proof: we have a linear error 
in SC for segmentations, but ’’joining” the segments leads to small errors (see 
figure 8): some parts could be taken into account zero or two times. These errors 
have to be bounded too. For this we will need bounds on the derivative of 
parametrizations of P ’’later” than in the (— (J)-enlargements; in the following 
lemma we work on (5 enlargements, with similar results. 

Lemma 2. If the angle between P and the axis is lower than 0 in the —S- 
enlargement, then it is lower than 0' = arcsin(sin(0) + SC) in the S- enlargement. 
Moreover, the distance to the axis is lower than Rcos{0) — Rcos{0') if P gets 
out of the —S enlargement at the lateral faces (extremities). 

Proof. See Fig. 9. This is the 2D projection of the worst case. One can easily 
derive the followings: 

Rcos{0') + A = Rcos{0) (15) 

Rsin{0') = Rsin{0) + S (16) 




Fig. 8. Left: Errors out of segments. Right: Case L large in Lemma 1 
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This yields the result. □ 

This implies that F has variations of angle within O' . The angle fl between Ti 
and Ti+i is bounded by 29' . Thus the length lost or counted two times be- 
tween Ti and is bounded by the F length between getting out of Ti and 
getting in which is bounded by ^ , with fl angle between Ti 

and Ti+i, with p = sin{9'). 

This leads to the following important corollary: 

Corollary 2. The error committed in joining two tubes is 0(6 fl) = 0(6^^'^ VC). 
As the number of such errors is bounded by , the total error resulting of 

joins is 0{6C). 

Hence, using corollary 1, we obtain the expected result. The error is linear 
under the above hypothesis. Constants have not been explicited as they are 
far from optimal by the proof below. More refined results would be useful. All 
hypothesizes are verified except the last one for usual segmentations, with <5 
linear in the resolution. We now have to study the hypothesis according to which 
L(Ti) > K^i/SjC, for any i < N. 

This hypothesis is particularly true whenever the discretization and polygo- 
nalization verify that any curve which lies in a tube of radius er with the reso- 
lution r is included in a segment (this can easily be proved by considering the 
fact that for bounding curvature C, the minimal length of F before er-deviation 
from the tangent is This will be refered in the sequel as an e-tolerant 

algorithm. Experiments below have been made with a straightforward algorithm 
which doesn’t have the algorithmic properties of the initial algorithm. 

Figure 9 illustrates the fact that with 8-connectivity and with segments 
only 8-connected, large errors can be made. One can have similar bad behaviour 
with 4-connectivity; as explained above, a solution to avoid such bad results is 
adding a tolerance in the algorithm (this is simply an increase of the width pa- 
rameters of bands assimilated to segments). As such pathological cases can be 
considered as negligible, perhaps the initial NSS algorithm would have a similar 
behavior in practical cases. Notice that such counter-examples do not mislead 
MLP. We present experiments illustrating proved results; Fig. 10 shows linear 
convergence for a circle (2D, convex, convergence proved in [24,22]), and Fig. 12 
shows linear convergence of the e-tolerant variant for an helix (3D, with bounded 
curvature, proof above). Then Fig. 13 shows that in practical 3D-cases, linear 
convergence can be achieved with an ellipse even without e-tolerance; however, 
convergence is much less regular than with e-tolerance. 

Note that, in the proof using convexity (see [24,22]) can be extended 
to a finite number of inflexions (or finite number of areas with null curvature), 
with bounded curvature. This is possible without the hypothesis above about 
the length of segments. Convexity avoids particular cases as in Fig. 9 (right). 

5.2 Experiments 

In our experimentations, we compute the NSS length of 3D mathematical curves 
(circles, ellipses and helixes) on which the theoretical length is known, using the 
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5-erdargement 

(-5)-en]argement 





Fig. 9. Left: Figure for lemma 2. Right: r{t) = {t,r'^cos{> pt/r)). The curvature 
is bounded by 1 for any value of the resolution r. The length computed (the 
length of the dashed curve) by a discretization algorithm in this case does not 
converge to the length of the curve for small values of r 



above algorithm. Hence we consider a digitization of those Euclidean objects, 
using the GIQ process, on grids of increasing resolution in order to check the 
asymptotic convergence. In Fig. 10, we present the multigrid error graph and the 
execution time graph in 2D for a circle, and in Fig. 12, we present the multigrid 
error graph with an e-tolerant algorithm for an helix in 3D. In Fig. 13, we show 
that in the particular case of a 3D ellipse, the NSS length algorithm converges. 
The interest of such a test is to check if it is possible, in concrete cases, to achieve 
a linear decrease of the error without e-tolerance and thus use the algorithmic 
advantages of the NSS approach. The error is defined as the percentage of the 
absolute value of the difference between the true length and estimated length, to 
the true length. First, these results show an experimental asymptotic convergence 
since the error decreases with the resolution. If we compute the error inverse, we 
verify the linear nature of the convergence. Since the starting point of a closed 
curve in the segmentation algorithm may lead to different polygonalizations, we 
have plot for the circles curve the 95% confidence intervals (vertical bars) when 
we compute all possible segmentations. Hence, we can see that this starting 
point problem does not affect the results. The execution graph shows the linear 
complexity of the segmentation process in the number of points of the discrete 
curve. 

In [21], Klette et al. have already compaired the DSS and the MLP ap- 
proaches in the 2D case and the comparisons both in execution time, speed of 
convergence and accuracy show that the DSS approach gives better results. In 
the 3D case, since the 3D MLP algorithm suffers from several problems, and 
since the 3D DSS does not, theoretically and practically, differ to the 2D one, 
we are convinced that the comparison will lead to the same results. 
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3D-NSS length estimator convergence 



3D NSS execution time 




Fig. 10. These graphs present the error of the estimator on both 3D circles and 
the execution time in ms on a Sun Ultra SparcWorkstation 




Fig. 11. An example of a NS-segmentation of a circle and an ellipse 



Improved algorithm on a helix Improved algorithm on a helix 





Fig. 12. Left: multigrid convergence of the e-tolerant algorithm in the case of an 
helix in Right: illustration of the error decrease 
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3D NSS length estimator convergence 



3D NSS length estimator convergence 




Fig. 13. Left: multigrid convergence of the 3D-NSS on a 3D ellipse of parameter 
(1, v^)- Right: illustration of the linear decrease of the error 



6 Conclusion 

In this article, we have first presented an arithmetical definition of 3D discrete 
lines and an extension to the third dimension of the discrete naive straight 
line recognition algorithm. Moreover we have shown a segmentation algorithm 
which is purely discrete and optimal in time (0{n) in time and 0(1) in space). 
Based on this algorithm, we then proposed a 3D discrete curve length estimator 
with the same complexity. In the last section, we have presented a proof of the 
multigrid convergence of length estimators based on a general tube framework in 
any dimension. We have shown that the discrete naive straight line segmentation 
verifies the proof hypothesis under a slight refinement and therefore the length 
estimator based on this algorithm also converges. The error bound we found 
is the same as in the literature concerning the DSS approaches but with less 
hypothesis on the underlying Euclidean curve. We suggest that this general 
framework could be used in the case of the 3D MLP approach: bounding the 
number of critical edges as in hypothesis (6) would be sufficient. 
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Abstract. Jordan, Peano and others introduced digitizations of sets in 
the plane and in the 3D space for the purpose of feature measurements. 
Features measured for digitized sets, such as perimeter, contents etc., 
should converge (for increasing grid resolution) towards the correspond- 
ing features of the given sets before digitization. This type of multigrid 
convergence is one option for performance evaluation of feature measure- 
ment in image analysis with respect to correctness. 

The paper reviews work in multigrid convergence in the context of digi- 
tal image analysis. In 2D, problems of area estimations and lower-order 
moment estimations do have ’’classical” solutions (Gauss, Dirichlet, Lan- 
dau et ah). Estimates of moments of arbitrary order are converging with 
speed k ( v ) = The linearity of convergence is known for three 

techniques for curve length estimation based on regular grids and polyg- 
onal approximations. Piecewise Lagrange interpolants of sampled curves 
allow faster convergence speed. A first algorithmic solution for conver- 
gent length estimation for digital curves in 3D has been suggested quite 
recently. In 3D, for problems of volume estimations and lower-order mo- 
ment estimations solutions have been known for about one-hundred years 
(Minkowski, Scherrer et ah). But the problem of multigrid surface con- 
tents measurement is still a challenge, and there is recent progress in this 
field. 



1 Introduction 

Geometric image analysis approaches are normally motivated by concepts in 
Euclidean geometry. A common strategy is: approximate picture subsets in 2D 
by polygons or in 3D by polyhedrons and use Euclidean geometry from that 
moment on for any further object analysis or manipulation step. A theoretical 
motivation is given by the fact that rectifiable curves and measurable surfaces 
can be approximated by polygonal curves or polyhedral surfaces up to any de- 
sired accuracy. This means that if we consider grid resolution as a potentially 
improvable parameter, then polygonal or polyhedral approximations appear to 
converge (for a set-theoretic metric) to the original preimage of the given ob- 
ject. The important question arises: does a convergence toward the true value 
also hold for calculated properties? For example, if we measure the length of a 
digital curve then the calculated value should converge to the correct length of 
a preimage in Euclidean space digitized with increasing grid resolution. 



G. Bertrand et al. (Eds.): Digital and Image Geometry, LNCS 2243, pp. 318—338, 2001. 
(c) Springer-Verlag Berlin Heidelberg 2001 
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In ancient mathematics, Archimedes and Liu Hui [39] estimated the length 
£(7) of a circular curve 7. Liu Hui used regular n-gon approximations, with 
n = 3,6, 12,24,48,96, ..., see left of Fig. 1. In case of n = 6 it follows 3 ■ d < 
£(7) < 3.46 • d for diameter d, and for n = 96 it follows that 



The used method is mathematically correct because the perimeters of inner and 
outer regular n-gons converge towards the circle’s perimeter for n 00 . For 
example, for the inner 3 x 2”-gons, having perimeters 



The function K(n) defines the speed of convergence, which is linear in this case. 

The perimeters of digitized circles have been calculated in image analysis 
using sometimes graph-theoretical concepts such as the length of a 4-curve, or of 
an 8-curve where diagonal steps are weighted by v^, see, for example, [13,27]. 
Such graph-theoretical concepts of path-length measurements are not related 
to digitized Euclidean geometry. Grid-intersection digitizations of line segments 
having a slope of 45° (for 4-paths) or of 22.5° (for 8-paths) provide simple ex- 
amples for illustrating this. Convergence of digital curves toward a preimage 
with respect to the Hausdorff-Chebyshev-distance does not imply convergence 
of length calculated for these digital curves, toward the true length, but a proper 
preprocessing step (e.g. polygonal approximation of digital curves) may ensure 
such a desirable property as will be shown below. 

We recall another historic example [35] cited in [18]. Assume that the lateral 
face L of a straight circular cylinder of radius p and of height h is cut by {k — 1) 
planes, k > 2, which are parallel to the base circles and which segment the 
cylinder into k congruent parts. Furthermore assume a regular n-gon, n > 3, in 



223/71 < 7T < 220/70 , i.e. tt « 3.14 . 




it follows 



K(n) = \pn — 27rr| « 2'Krjn , for n > 6 . 




3 e 12 24 <te 90 132 364 708 1930 



Fig. 1. Inner and outer hexagon approximating a circle (left), and percentage 
errors for perimeter estimation using inner n-gons 
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Fig. 2. Triangulation of the lateral face of a straight circular cylinder [35] 

every cross section including both base circles, see Fig. 2 for fc = 4 and n = 6. 
The axis of the cylinder and any vertex of such an n-gon defines a halfplane, 
which bisects an edge of the n-gon in the neighboring cross section or base circle. 
Now we connect for two neighboring n-gons each edge in one n-gon with those 
vertex of the other n-gon closest to this edge. This results into a triangulation 
(i-e. a specific polyhedrization) of the lateral face L of the cylinder into 2kn 
congruent triangles having a surface area equal to 

.4(Tt„) = ^ . 

If k and n go to infinity then the length of the edges of the triangular faces of 
Tk,n converges to zero. However, the surface area of Tk^n does not necessarily 
converge towards the surface area -4(L) = 2Trph of the lateral face! This is only 
true if k and n go to infinity such that fc/n^ converges to zero. If k/n^ converges 
to g > 0 then converges to 

It may even happen that goes to infinity, e.g. k = n^, and then it follows 
that A{Tk,n) goes to infinity as well! Note that this example is based on sampling 
of surface points which cannot be assumed in image analysis. Digitization of sets 
provides an even less accurate input for subsequent steps of feature measurement. 

The paper specifies the concept of multigrid convergence and reviews related 
results for measurements of moments, the length of curves in 2D or 3D, and 
surface area. 

2 Multigrid Convergence 

First we recall three digitization models frequently used in image analysis: Gauss 
digitization, grid-intersection digitization (for 2D only), and inner or outer Jor- 
dan digitization. 
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Let r > 0 be a real number called grid resolution. The dilation of a set 
S C IR" by factor r is defined to be 

r-S = {{r ■ xi,...,r ■ Xn) : (xi, . . . , x„) e , 

for n > 1. Following [17], this is a dilation with respect to the origin (0, . . . , 0), 
and other points in the Euclidean space 3R" could be chosen to be the fixpoint 
as well. 

In studies on multigrid convergence sometimes it may be more appropriate 
to consider sets of the form r ■ S (the approach preferred, e.g., by Jordan and 
Minkowski) digitized in the orthogonal grid with unit grid length, instead of 
sets S digitized in r-grids with grid length 1/r. The study of r — > oo corresponds 
to the increase in grid resolution, and this may be either a study of repeatedly 
dilated sets r • S' in the grid with unit grid length, or of a given set S in repeat- 
edly refined grids. This is a general duality principle for multigrid studies [25]. 
We choose the repeatedly refined grid approach for this paper which is of com- 
mon use in numerics. An r-grid point gl ^ = {iijr, . . . ^infr) is defined by 

integers ii, . . . ,i„. 

Definition 1. For a set S C n>l, its Gauss digitization Gr{S) is defined 
to he the set of all r-grid points contained in S . When r = 1 the Gauss digitization 
is denoted by G{S). 

For example, consider and all r-grid points as centers of isothetic squares 
with edge length 1/r. Then the set Gr(S) is defined to be the union of all those 
squares having their center points in Gr{S). 

If the given set is a curve 7 in the plane then the grid-intersection model [7,12] 
is of common use in digital geometry. Of course, this scheme can be adapted 
to r-grid points for any value of r > 0, and the resulting sequence of r-grid 
points is the grid intersection digitization A (7), which can be characterized by 
a start point and a chain code (i.e. a sequence of directional codes). A digital 
straight line Ir{l) is an 8-curve of r-grid points resulting from the grid-intersec- 
tion digitization of the straight line 7 in the Euclidean plane, excluding the 
straight lines y = x -\- i/2, where i is an integer. A digital straight line segment 
(DSS) is a finite 8-connected subsequence of a digital straight line. 

The important problem of volume estimation was studied in [17] based on 
gridding techniques. Any grid point (i, j, k) in the Euclidean space is assumed 
to be the center point of a cube with faces parallel to the coordinate planes and 
with edges of length 1. The boundary is part of this cube (i.e. it is a closed 
set). Let S' be a set contained in the union of finitely many such cubes. Dilate 
the set S with respect to an arbitrary point p G in the ratio r : 1. This 
transforms S into Sfl. Let lf{S) be the number of cubes completely contained 
in the interior of Sf., and let uh^{S) be the number of cubes having a non-empty 
intersection with S^. In [17] it is shown that r~^ ■ 1^{S) and r~^ ■ uP{S) always 
converge to limit values L{S) and U{S), respectively, for r 00, independently 
of the chosen point p. Jordan called L{S) the inner volume and U{S) the outer 



322 



Reinliard Klette 



volume of set S, or the volume V(S') of S if L{S) = U{S). The volume definition 
based on gridding techniques was further studied, e.g., in [29,34]. 

The following definition is about an n-dimensional situation. For n = 3, 
a regular Euclidean cell complex consists of (topologically closed) r-cubes, r- 
squares, r-edges and r- vertices (see [20] for a review on cell complexes), and gen- 
eralizations to higher dimensions, as well as a restriction to the two-dimensional 
case are straightforward. 

Definition 2. For a set S C 5ft", n > 1, its Jordan digitizations JE{S) 
and are defined as follows: the set Jf{S) (also called the inner digi- 

tizationj contains all n-dimensional cells completely contained in the interior 
of set S, and the set Jf~{S) (also called the outer digitization^ contains all n- 
dimensional cells having a non-empty intersection with set S. 

The unions of all cells contained in Jf{S) or Jf~{S) are isothetic polyhedra 
J“(5') or J+(5), respectively. The Hausdorff-Chebyshev distance, generated by 
the doo metric, between the polyhedral boundaries 03 f (S) and dJf (S) is greater 
than or equal to 1/r for any non-empty closed set S, and it holds that 

J;(5) CSC3+{S) 

in this case. 

Gauss and Jordan digitizations have been used in gridding studies in math- 
ematics (geometry of numbers, number theory, analysis). The model of grid- 
intersection digitization has been introduced for computer images, and it may 
be applied to planar curves. 

A general scheme for comparing results obtained for picture subsets with 
the true quantities defined by the corresponding operation on the preimage in 
Euclidean space has been formalized in [36]. The following definition [18] specifies 
a measure for the speed of convergence toward the true quantity. 

Definition 3. Let F be a family of sets S in 5ft", and digr{S) a digital image of 
set S, defined by a digitization mapping digr- Assume that a quantitative property 
V , such as area, perimeter, or a moment, is defined for all sets in family F. An 
estimator E-p is multigrid convergent for this family F and this digitization 
model digr iff there is a grid resolution rs > 0 for any set S G F such that the 
estimator value Ep{digr{S)) is defined for any grid resolution r > rs, and 

\Ep{dtgr{S)) - r{S)\<K{r) 

for a function n defined for real numbers, having positive real values only, and 
converging toward 0 if r oo. The function k specifies the convergence speed. 

Gauss and Dirichlet knew already that the number of grid points inside a 
planar convex curve 7 estimates the area of the set bounded by the curve within 
an order of 0{£{'y)), where £(7) is the length of 7. 

Theorem 1. (Gauss/Dirichlet ca. 1820) For the family of planar convex sets, 
the number of r-grid points contained in a set approximates the true area with 
at least linear convergence speed, i.e. «(r) = r~^ . 
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Today we know that the convergence speed of this estimator is actually at 
least [15] for planar, bounded, 3-smooth (i.e. continuous 3rd deriva- 

tives with positive curvature at all boundary points except a finite number of 
arc endpoints) convex sets, and it cannot be better than which is a trivial 

lower bound. 

Theorem 2. (Huxley 1990) For the family of planar, bounded, 3-smooth convex 
sets, the number of r-grid points contained in a set approximates the true area 
with a convergence speed of n(r) = r“, for —1.5 < a < —1.3636. 

Closing the gap is an open problem which is a famous subject in number the- 
ory [28], and is closely related to digital geometry [24]. 

3 Moments and Moment-Based Features in 2D 

In this section we cite worst-case error bounds from [25] in estimating real mo- 
ments (and related features) of sets S C from corresponding discrete mo- 
ments. Note that in case of 3-smooth sets the claimed positive curvature ex- 
cludes straight boundary segments. Throughout this section we assume that S 
is a planar convex set whose boundary consists of a finite number of arcs, 
also allowing straight line segments if not otherwise stated. The {p, q) -moments 
of set S are defined by 

Mp,q{S) = JJ dx dy , 

s 

for integers p, q > 0. The moment M.p^q{S) has the order p-\-q. In image analysis, 
the exact values of moments M.p^q{S) remain unknown. They are estimated by 
discrete moments pp^q{S) where 

i^pAS)= 

(ij)6G(S) 

which can be calculated from the corresponding digitized set G{S) of set S. 
The grid resolution r has to be used as scaling factor if the approach involves 
repeatedly refined grids. The moment-concept has been introduced into image 
analysis in [14]. 

The contents or area -4(5') of a planar set 5, i.e. the moment A4o,o(5) of 
order zero, is estimated by the number of grid points in G(5), i.e. by the discrete 
moment po.o(5). For the center of gravity of a set 5, 

/ ■Mi,o(5) Mop{S) \ 

\Mo.o{S)^ Mo.oiS)) 



f Mi,o( 5) /rp,i(5) \ 
\Mo,o( 5)’ po,o(5)y 



the estimate 
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is calculated from its digital set G{S). The orientation of a set S can be described 
by its axis of the least second moment. That is the line for which the integral of 
the squares of the distances to points of S' is a minimum. That integral is 







s 



r'^{x,y,(p,p)dxdy , 



where r(x, y, (p, p) is the perpendicular distance from the point (x, y) to the line 
given in the form 

X ■ cos ip — y ■ sin p = p . 

We are looking for the value of p for which /(S, p, p) takes its minimum, and by 
this angle we define the orientation of the set S. This (/3-value will be denoted 
by T>(S), i.e. 



min /(S, (/3, p) = I{S,'D{S), p), for some value of p . 

v,p 

Again, this feature is estimated by replacing integration and set S by a dis- 
crete addition and a digital set G(S), respectively. With respect to applications 
note that this feature requires sets with ”a main orientation”, i.e. A^ 2 ,o(S) yf 
Ado, 2 ( 5 ). Finally, we also mention the elongation of S (see [16,42]) in direction 
p which is the ratio of maximum and minimum values of /(S', (/?, p), i.e. 

max/(S, (p, p) 

£{S) = ^ . 

mm/(S,(p,p) 

p, p 

It may be estimated by digital approximations of the /-function values as in case 
of the orientation of set S. 

The curvature of the boundary of the considered set plays an important role. 
It makes an essential difference whether at least one straight section on the 
boundary is allowed or not. 

Theorem 3. (Klette/Zunic 2000) Let S a convex set whose boundary consists 
of a finite number of arcs, then Aip^q{S) can be estimated by ■ 

Pp.qif ■ S) within an error of 0{r~^), and this error term is the best possible. 

However, if S is 3-smooth and convex, i.e. the boundary does not possess any 
straight segment, then the application of Huxley’s theorem leads to a reduced 
upper error bound. 

Theorem 4. (Klette/Zunic 2000) Let a planar bounded 3-smooth convex set S 
be given. Then Aip^q{S) can be estimated by • Pp^q{r ■ S) within an 

error of O (jlogr)^) ■ r~^^ « O . 

The following theorem specifies how progress in the estimation of the “basic 
difference” |Ado,o(^ ■ S) — r'^ ■ A(5')| by 0(K(r)) can be used to improve error 
bounds for higher-order estimates |Adp,g(5') — • Pp^q{r ■ S')], for a set S 

being n-smooth, for some integer n = 3,4, . . . , including n = 00 . For a function 
«(r) > 0, for r > 0, a family /).(r) of classes C of planar sets is defined such that 
the the following conditions are satisfied: 
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(i) C is nonempty; 

(ii) if S' € C then it satisfies 

lAio,o(?' • S) - • A{S)\ = O (K(r)) ; 

(iii) if a set S belongs to C then any isometric transformation of S belongs to C 
as well; 

(iv) any set which can be represented by a finite number of unions, intersections 
and set-differences of sets from C also belongs to C. 

This definition allows a formulation of the following theorem which ’trans- 
lates’ possible future progress in number theory into a formulation of related 
error bounds for moments of arbitrary order. Let Fq be the smallest family 
of sets which contains all n-smooth planar convex bounded sets, and which is 
closed with respect to finite numbers of intersections, unions and set-theoretical 
differences. 

Theorem 5. (Klette/Zunic 2000) Let S be a planar n-smooth convex set and 
let nir) he such that Fq is contained in the family It follows that Mp^q(S) 

can be estimated by ■ p,p^q{r ■ S) within an error of O [K{r) ■ 

Let 5” be a set in a class in Fq. The given theorems allow a derivation of the 
following upper error bounds for feature estimations [25]. An upper error bound 
for area estimates ^ /tq.o (?' ■ S) is directly given by Huxley's theorem, i.e. 

■ /^o,o (?' • 5)1 = O . 



The same upper error bound holds for the estimates 



1 ^ Mi,o {r ■ S) 
r /ro,o {r ■ S) 



and 



1 ^ fiQj (r ■ S) 
r ^ 0,0 {r ■ S) 



of the coordinates 

mi.o (5) 77io,i (5) 

mo,o(5) mo,o(5) 

of the center of gravity, respectively. For the estimate of the orientation only 
sets S with M 2 ,o{S) yf Alo.2(5) are relevant. Then S's orientation 'D(S) can be 
recovered within an worst-case error of 0(r“TT+®), by using the estimate 



tan(2 •T>(5)) 



2 ■ 1^1, i{r ■ S) 

fi2,o(^-'5)-Mo.2(»'-'S') ■ 



The elongation 5 of a 3-smooth convex set S can be estimated by 



_ ti(r ■ S) + ^Jt 2 {r ■ S) 
ti{r ■ S) - \/t 2 {r ■ S) ’ 
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for 

ti{r ■ S) = Jl2,o{r ■ S) + JIo 2ir ■ S) 

and 

t2(r ■ S)=4- • S)f + (7l2,o(?' ' S) - ’ >5'))^ , 

for a planar set S. The error in the approximation £{S) « 0{r ■ S) has an 
upper error bound in O 

Of course, Theorems 3, 4, and 5 may also be used to derive error bounds for 
features defined by moments of higher order than just up to order two. 



4 Length of Curves in 2D and 3D 

There are provable convergent length estimators in 2D and 3D, where linear 
multigrid convergence has been shown for planar convex curves. A superlinear 
convergence of asymptotic length estimation has been achieved in [10] 

just for the case of digitized straight lines, and there are superlinear length 
estimators in 2D if sampling of curves is assumed instead of digitization. 

4.1 Polygonal Approximations of Curves in 2D 

Boundaries of digitized planar sets, or digitized planar curves, can be approxi- 
mated by a polygon, using a digital straight segment (DSS) procedure to segment 
the boundary into a sequence of maximal-length DSSs. The resulting polygon 
depends on the starting point and the orientation of the scan. Besides this DSS- 
based approach to the approximation of digital curves by polygons, there are 
other possible approaches using minimum-length polygons; see [5,6,38]. We re- 
view all three methods with respect to multigrid convergence. 

Given a connected region S in the Euclidean plane and a grid resolution r, 
the r-frontier dGr{S) of S is uniquely determined. Note that an r-frontier may 
consists of several non-connected curves even in the case of a bounded convex 
set S. A set S is r-eompaet iff there is a number rg > 0 such that dGr{S) is 
just one (connected) curve, for any r > tq. 

Theorem 6. (Kovalevsky/Fuchs 1992, Klette/Zunic 2000) Let S be a convex, r- 
compact polygonal set in 3?^. Then there exists a grid resolution tq such that for 
all r > ro, any DSS approximation of the r-frontier dGr{S) is a connected 
polygon with perimeter pr satisfying the inequality 

\C{d{S))-pr\<^-^[eoss{r) + ^^ . 

This theorem and its proof can be found in [25]. The proof is based to a large 
extent on material given in [26]. The value of tq depends on the given set, and 
^Dssif) > 0 is an algorithm-dependent approximation threshold specifying the 
maximum Hausdorff-Chebyshev distance (generalizing the Euclidean distance 
between points to a distance between sets of points) between the r-frontier 
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dGr{S) and the constructed (not uniquely specified !) DSS approximation poly- 
gon. Assuming £_DSs(f) = 1/r, it follows that the upper error bound for DSS 
approximations is characterized by^ 

27t 27t 4.5 ^ , 

— -I- ^ « — if r ^ 1 (i.e. r is large) . 

r‘^ r■^/2 r 

The grid resolution 1/r is assumed in the chord property in [33], where a DSS is 
defined to be a finite 8-path. In the case of using cell complexes it is appropriate 
to consider a finite 4-path as a DSS iff its main diagonal width is less than -\/2, 
see [1,32]. 

A second approach, see [38], is based on Jordan digitization of sets S in the 
Euclidean plane. The difference set O^(S') \Ir(>S') can be transformed into a sub- 
set such that the Haiisdorff-Chebyshev distance (generated by the doo metric) 
between its inner and outer boundary is exactly 1/r, i.e. the grid constant. The 
perimeter of S can be estimated by the length of a minimum-length polygon 
(MLP) contained in this subset, and circumscribing the inner boundary of this 
subset, which is homeomorphic to an annulus. The subset can be described by a 
sequence of r-squares, where any r-square has exactly two edge-neighboring r- 
squares in the sequence. Such a sequence is called a one- dimensional grid con- 
tinuum (ID-GC). Such ID-GCs are treated in the theory of 2D cell complexes 
in the plane. This specifies an alternative approach (GG-MLP in short) to the 
approximation of digital curves; it has been experimentally compared with the 
DSS method in [23]. 

For the case of GG-MLP approximations there are several convergence the- 
orems in [38], showing that the perimeter of the GG-MLP approximation is a 
convergent estimator of the perimeter of a bounded, convex, smooth or polyg- 
onal set in the Euclidean plane. The following theorem is basically a quotation 
from [38]; it specifies the asymptotic constant for GG-MLP perimeter estimates. 



Theorem 7. (Sloboda/Zatko/Stoer 1998) Let ^ be a (closed) convex curve in 
the Euclidean plane which is contained in a ID-GC of r-squares, for r > 1. Then 
the GC-MLP approximation of this ID-GG is a connected polygonal curve with 
length A satisfying the inequality 

g 

Ir "E: di(y) Ir E — ■ 
r 

Finally we sketch a third method, see [5,6], which is also based on minimum- 
length polygon calculation. Assume an r-frontier of S which can be represented 
in the form P = {vo,vi, . . . ,Vn-i) where vertices are clockwise ordered and 
the interior of S lies to the right. For each vertex of P we define forward and 
backward shifts: The forward shift f{vi) of vt is the point on the edge (vi,Vi+i) 
at the distance 6 from Vi. The backward shift b(yi) is that on the edge (vi-i,Vi) 
at the distance <5 from Vi. 

^ Let ft(r) = 2-k jr'^ 2 -k jr ■ \/2. Then it follows that ft(r) ^ 7r-\/2 as r — > oo. 
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In the approximation scheme as detailed below we replace an edge (vi,Vi+i) 
by a line segment (vi^ f{vi^i)) interconnecting Vi and the forward shift of ni+i, 
which is referred to as the forward approximating segment and denoted hy L f(vi). 
The backward approximating segment (vi,b(vi-i)) is defined similarly and de- 
noted by Lb{vi). Now we have three sets of edges, original edges of the r-frontier, 
forward and backward approximating segments. Let 0 < <5 < 0.5/r. Based on 
these edges we define a connected region Af.{S), which is homeomorphic to the 
annulus, as follows: 

Given a polygonal circuit P describing an r-frontier in clockwise orientation, 
by reversing P we obtain a polygonal circuit Q in counterclockwise order. In the 
initialization step of our approximation procedure we consider P and Q as the 
external and internal bounding polygons of a polygon Pb homeomorphic to the 
annulus. It follows that this initial polygon Pb has area contents zero, and as a 
set of points it coincides with dGr{S). 

Now we ‘move’ the external polygon P ‘away’ from Gr(5'), and the internal 
polygon Q ‘into’ Gr{S) as specified below. This process will expand Pb step 
by step into a final polygon which contains dGr{S), and where the Hausdorff- 
Chebyshev distance between P and Q becomes non-zero. For this purpose, we 
add forward and backward approximating segments to P and Q in order to 
increase the area contents of the polygon Pb- 

To be precise, for any forward or backward approximating segment Lf(vi) 
or Lh(vi) we first remove the part lying in the interior of the current polygon Pb 
and updating the polygon Pb by adding the remaining part of the segment 
as a new boundary edge. The direction of the edge is determined so that the 
interior of Pb lies to the right of it. The resulting polygon P^ is referred to as 
the approximating sausage of the r-frontier and denoted by A^{S). The width 
of such an approximating sausage depends on the value of S. An AS-MLP curve 
for approximating the boundary of S is defined as being a shortest closed curve 
7^(5') lying entirely in the interior of the approximating sausage A^(S'), and 
encircling the internal boundary of Af (5'). It follows that such an AS-MLP curve 
7^(5) is uniquely defined, and that it is a polygonal curve defined by finitely 
many straight segments. Note that this curve depends upon the choice of the 
approximation constant S. 

Theorem 8. (Asano/Kawamura/Klette/Obokkata2000) Let S be a bounded, r- 
compact convex polygonal set. Then, there exists a grid resolution rg such that 
for all r > rQ it holds that any AS-MLP approximation of the r-frontier dGr{S), 
with 0 < 5 < .5/r, is a connected polygon with a perimeter 1^ and 

\C{dS)-lr\ < (4V2-k 8 * 0.0234)/r= 5.844/r. (1) 

These three Theorems 6, 7 and 8 show that the DSS error bound of 4. 5/r is 
smaller than the AS-MLP bound 5.844/r, and the AS-MLP is smaller than the 
GC-MLP bound 8/r . Further theoretical and experimental measures may be 
used for performance comparisons such as effectiveness defined by the product 
of error and number of generated line segments, or the time efficiency of imple- 
mented algorithms. With respect to asymptotic time complexity, a linear-time 
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algorithm is known for any of these three linear approximation (i.e. polygonal- 
ization) methods [6,23]. 

4.2 Higher-Order Approximation for Sampled 2D Curves 

Higher-order approximations of curves with the purpose of length estimations 
have been studied in [30,31]: for fc > 1, estimate the length £( 7 ) of a regular 
parametric curve 7 : [ 0 , 1 ] ^ JR” from (m -|- l)-tuples Qm = ■ ■ ■ , Qm) of 

points Qi = ^{ti) positioned on the curve 7 . The parameters U's are not assumed 
to be given. Of course, sampling (see both examples in the Introduction) is a 
different situation compared to digitization. An increase in grid resolution r 
defining a scale for two dimensions in the plane, corresponds to an increase in 
the number m of sampling points defining a one-dimensional scale on the curve. 

Some assumptions about the distribution of the ti's are needed to make the 
sampling problem solvable [30]. The problem is the easiest when the £’s are 
chosen in a perfectly uniform manner, namely = ^. In such a case it seems 
natural to estimate 7 by a curve 7 that is piecewise polynomial of degree a > 1 . 
Then we prove 

Theorem 9. (Noakes/Kozera/Klette 2001) Let 7 he and let ti’s be sam- 
pled perfectly uniformly. Then £( 7 ) = £( 7 ) + 0( .+ao ); where sq is 1 or 2 
according as s is odd or even. 

It is known [31] that Lagrange estimates of length based on a uniform grid do 
not always converge to £( 7 ) when the unknown tfs are non-uniform. In [30] it 
is shown that there are some approximately uniform samplings of tfs for which 
those estimates are well-behaved. More precisely 

Definition 4. For £ > 0, the ti’s are (e , k) -uniformly sampled if there is a 
reparameterization 4> : [ 0 , 1] ^ [ 0 , 1], for k > 1, such that ti = + 0 ( ^i+e ). 

Lagrange estimates of length can behave badly for (0, fc)-uniform sampling, 
see [31], but for 0 < £ < 1 the following theorem holds [30], using piecewise 
Lagrange interpolants 7 . 

Theorem 10. (Noakes/Kozera/Klette 2001) Let the ti’s be sampled {e,k)- 
uniformly where 0 < £ < 1 and k > A. Then, for piecewise- quadratic Lagrange 
interpolants 7 , determined by a sampled -tuple Qm o.nd based on a uniform 

grid, £( 7 ) = £( 7 ) -k 0{:^). 

Whereas Theorem 9 permits length estimates of arbitrary accuracy (for s 
sufficiently large). Theorem 10 refers only to piecewise-quadratic estimates (i.e. 
s = 2), and accuracy is limited accordingly. However, even in the latter case it 
holds that the quartic convergence speed^ is three magnitudes faster then the 
linear convergence speed discussed for DSS, GC-MLP and AS-MLP polygonal- 
izations (with s = 1). These sampling-based results encourage further research 
on higher-order approximations for digitized curves. 



^ Note that m specifies an increase in the order of ydn only with respect to a scale 
for two dimensions in the plane, i.e. the quartic convergence speed in m may be 
compared with a quadratic convergence speed in r. 
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4.3 A Polygonal Approximation of Curves in 3D 

Consider the length estimation problem for rectifiable curves 7 in the three- 
dimensional Euclidean space. We assume curves 7 which lead to simple cube 
curves for the digitization model 

A cube-curve is a sequence g = (/o, cq, / i, ci, c„) of r-faces fi and r- 

cubes Ci in for 0 < i < n, such that r-faces fi and /i+i are sides of r-cube Ci, 
for 0 < i < n and fn+i = /o- Such a cube-curve is simple iff n > 4, and for any 
two r-cubes Ci, Ck in g with |i — fc| > 2 (mod n-|-l) it holds that if CiDc/c yf 0 then 
either \i — k\ = 2 (mod n -|- 1) and ct fl Cfc is an r-edge, or |* — fc| = 3 (mod n -I- 1) 
and Ci n Cfc is an r-vertex. A tube g is the union of all r-cubes contained in a 
cube-curve g. Such a tube is a polyhedrally-bounded compact set in 3?^, and it 
is homeomorphic with a torus in case of a simple cube-curve. 

A curve is complete in g iff it has a non-empty intersection with any r-cube 
contained in g. 

Definition 5. A minimum-length polygon (MLP) of a simple cube-curve g is 
a shortest polygonal simple curve a which is contained and complete in tube g. 

Following [38], the length of a simple cube-curve g is defined to be the length 
C(a) of an MLP a of g. Theorem 7 states that this length estimation approach 
is multigrid convergent to the true value in case of planar convex curves 7 as 
specified in this theorem. 

An algorithm for approximating such an MLP in a simple cube-curve has 
been specified in [8]. It is based on the following theorem [21]. An edge contained 
in a tube g is critical iff this edge is the intersection of three cubes contained in 
the cube-curve g. 

Theorem 11. (Buelow/Klette 2000) Let g be a simple cube-curve. Critical ed- 
ges are the only possible locations of vertices of a shortest polygonal simple curve 
contained and complete in tube g. 

The algorithmic solution in [8] provides a polygonal approximation of desired 
MLP’s, and thus a length measurement method for simple cube-curves in 3D 
space. The algorithm possesses a measured time complexity in 0(n). However, 
two open problems remain at this stage: the time complexity might be provable 
always in 0(n), and the convergence might be provable always towards the MLP. 
For details of the algorithm see [8] . 

5 Surface Area of Regular Solids 

A ‘3D object’ can be modeled by a regular solid, which is defined to be a simply- 
connected compact set having a measurable surface area [18]. Algorithms for 
multigrid-convergent surface area estimation are still a research topic. Obviously, 
increasing the grid resolution in a digitization of a regular solid in the form of 
a 3D cell complex [4], and measuring the area of the resulting isothetic surface, 
does not result in convergence to the true value. This might be compared with 
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Fig. 3. Three Euclidean sets digitized for increasing grid resolution and approx- 
imated by marching-cube polyhedrizations 



the fact that 4-path length is not related to the length of a digitized curve in 
2D. Marching-cube based polyhedrizations, see Fig. 3 do not support multigrid- 
convergent surface area estimations toward the true value [19]. This might be 
compared with the fact that 8-path length (with weighting factor -\/2 for diagonal 
steps) is not related to the length of a digitized curve in 2D. 

Polyhedrization is a common goal of segmenting the surface of a digitized 
regular solid, normally given in the form of boundary points of a 3D grid point 
set (e.g. using 3D Gauss digitization) or in the form of a two-dimensional grid 
continuum (2D-GC) defined by a difference between the inner and outer Jordan 
digitizations. 

5.1 Experimentally Measured Convergence 

Expanding the ideas of DSS approximations into 3D leads to a digital plane seg- 
ment (DPS) approach for achieving multigrid-convergent measurement of sur- 
face area: the boundary of a Gauss- or Jordan-digitized set is segmented into 
maximum-size DPSs, and surface areas of related polyhedral faces are added to 
form a final estimate. 

[32] introduced arithmetic geometry which allows characterizations of hy- 
perplanes in n-dimensional spaces. [2] proposed a general definition that linked 
planes and topology, introducing |a] -I- \b\ [c] thick planes. These planes were 

further specified and used in [11]. For a generalization to n dimensions see [3]. 
Digital plane segments can be defined within arithmetic geometry as follows: r- 
cubes have eight directed diagonals. The main diagonal of a Euclidean plane 
is those directed diagonal (out of these eight) that has the largest dot product 
(inner product) with the normal of the plane. Note that in general there may be 
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two possible diagonals 



Fig. 4. Illustration of the main diagonal of a DPS 



more than one main diagonal for a Euclidean plane; if so, we can choose any of 
them as the main diagonal. The distance between two parallel Euclidean planes 
in the main diagonal direction is called the main diagonal distance between these 
two planes. 

Now consider a finite set of faces of r-cubes in 3D space. A Euclidean plane 
is called a supporting plane of this set if it is incident with at least three non- 
collinear vertices of the set of faces, and all the faces of the set are in only one 
of the (closed) halfspaces defined by the plane. Note that any non-empty finite 
set of faces has at least one supporting plane. Any supporting plane defines a 
tangential plane, which is the nearest parallel plane to the supporting plane such 
that all faces of the given set are within the closed slice defined by the supporting 
and tangential planes. Note that a tangential plane may be a supporting plane 
as well. Figure 4 gives a rough sketch of such a set of faces, where n denotes the 
normal to the two parallel planes, and v is the main diagonal. 

Definition 6. A finite, edge- connected set of faces in 3D space is a digital pla- 
nar segment (DPS) iff it has a supporting plane such that the main diagonal 
distance between this plane and its corresponding tangential plane is less than 
V^/r (i.e. the length of a diagonal of an r-cube). 

Such a supporting plane is called effective for the given set of grid faces. Let 
V be a vector in a main diagonal direction with a length of V^/r, let n be the 
normal vector to a pair of parallel planes, and let d = n • po be the equation of 
one of these planes. According to our definition of a DPS, all the vertices p of 
the faces of a DPS must satisfy the following inequality: 

0<np — d<nv 

Let n = (a, b, c). Then this inequality becomes 

0 < ax by cz — d < |a| -I- |&| -I- |c| , 

i.e. an DPS is an edge-connected subset of faces in a standard plane [11]. A 
simply-connected DPS is such that the union of its faces is topologically equiv- 
alent (in Euclidean space) to the unit disk. 
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The general DPS recognition problem can be stated as follows: Given n 
vertices {pi, P2, • ■ • , Pn}, does there exist a DPS such that each vertex satisfies 
the inequality system 



0<n-pi — d<n-v, i = 

[40] suggests a method of turning this into a linear inequality system, by elimi- 
nating the unknown d as follows: 

n-p, -n-pj < n- V, = 

This system of n? inequalities can be solved in various ways. [11,40] use a Fourier 
elimination algorithm. However, this algorithm is not time-efficient even for very 
small cell complexes. In fact, in [40] a more advanced elimination technique than 
Fourier-Motzkin was proposed to eliminate unknowns from systems of inequali- 
ties. This technique eliminates all variables at once, whereas the Fourier-Motzkin 
technique eliminates one variable at a time. Eliminating all variables at once 
leads to an O(n^) algorithm for recognizing a DPS, which is faster than the 
algorithm sketched in [11]. [40] included results for hyperplanes of arbitrary di- 
mension. Note that two different definitions are actually used to define digital 
planes, depending on what kind of connectivity relation is required: 

0 < ax + by + cz — d < |a| -I- |6| -I- |c| 
or 

0 < ax + by + cz — d < max{\a\^ |6|, |c|) . 

The second definition was used in [40], but the results obtained there for the 
elimination technique are equally valid for the first definition. 

An incremental algorithm for DPS recognition has been proposed in [22], 
based on updating lists of effective supporting planes. This algorithm can be 
used for segmenting boundaries of digitized 3D sets into maximal-size DPSs, see 
Fig. 5 for two examples. 

Actually, any DPS recognition algorithm could be used for segmenting a 
surface of a 3D cell complex into maximal-size DPSs. However, the starting 
point and the search strategy during the process of ‘growing’ a DPS are critical 
for the behavior of such an algorithm. Also, after obtaining maximal-size DPSs, 
it is not straightforward to derive a polyhedron from the resulting segmentation 
of the surface. 

Analytical surface area calculation of an ellipsoid, with all three semi-axes 
a,b,c allowed to be different, is a complicated task. If two semi-axes coincide, 
i.e. in the case of an ellipsoid of revolution, the surface area can be analytically 
specified in terms of standard functions. The surface area formula in the general 
case is based on standard elliptic integrals. Example 2 in [18], reporting recent 
work by G.Tee, specifies an analytical method of computing the surface area 
of a general ellipsoid. This area can be used in experimental studies as ground 
truth to evaluate the performance of DPS algorithms in surface area estimation. 
Figure 6 shows the error in the estimated value relative to the true value for an 
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Fig. 5. Agglomeration of faces of a sphere and an ellipsoid into DPSs 



Relative Error 




Fig. 6. Relative errors in surface area estimation for an ellipsoid in three orien- 
tations for increasing grid resolution. Figure 5 illustrates resolution r = 40 



ellipsoid in three different orientations, using a search depth of 10 in region grow- 
ing (breadth-first search). In general these DPS-based estimates behave ‘better’ 
than those based on convex hull (for digitized convex sets) or on marching-cube 
algorithms. Convex hull and marching-cube methods lead to relative errors of 
3.22% and 10.80% for r = 100, respectively, while the DPS error is less than 
0.8%. The DPS method shows a good tendency to converge, but theoretical 
work needs to be done to prove this. Altogether, there are working algorithms 
which appear to provide multigrid-convergent surface area estimations, but there 
is no related theorem stating this property for some type of 3D sets. 
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5.2 Multigrid Convergence of Estimated Surface Area 

Recently there is actually progress on proving multigrid-convergent behavior 
of surface area estimation, but so far without an algorithmic solution for the 
proposed method! [9,37] introduce the relative eonvex hull CHg(P) of a polyhe- 
dral solid P which is completely contained in the interior of another polyhedral 
solid Q. If the convex hull CH(P) is contained in Q then CHq(P) = CH(P); 
otherwise CHg(P) C Q is a ‘shrunk version’ of the convex hull. To be precise, 
let pq be the (real) straight line segment from point p to point q in 3?^, and 
introduce the following definition[37]: 

Definition 7. A set A C Q C 3?^ is Q-convex iff for all p, q € A sueh that 
pq C Q we have pq C A. Let P ‘A Q. The relative convex hull CHq{P) of P 
with respeet to Q is the intersection of all Q-eonvex sets containing P . 

For a set 5” C 3?^ we defined the inner and outer Jordan digitizations 3f{S) 
and J^(/S') for grid resolution r > 1. If 5" is a regular solid with a defined surface 
area, let A(5') be its surface area in the Minkowski sense [29]. 

Theorem 12. (Sloboda/Zatko 2000) Let S C be a compact set hounded by 
a smooth closed Jordan surface iDS. Then 

lim^s(cH,^^S)i^riS)))=A{S) . 

This theorem, from [37], specifies a method of multigrid convergence which still 
requires research on algorithmic implementation, theoretical and experimental 
convergence speed, and performance evaluation in comparison with other meth- 
ods such as the DPS segmentation method sketched above. 

6 Conclusions 

Euclidean geometry specifies the ground truth, the correct moment, length or 
surface area prior to digitization. The concept of multigrid convergence may pro- 
vide a general methodology for evaluating and comparing different approaches. 
The measurement of quantitative properties is certainly a main topic in digital 
geometry. [27] is one of the early publications in this area, and [23] is one of 
the more recent ones, both focusing on length estimates. Probability-theoretical 
aspects of digitization errors [41,43] have only been studied for a few elementary 
figures and simple geometric problems; further studies should provide answers 
to open problems such as those listed in [25]. 

There is still no solution with respect to multigrid convergence for surface 
area estimation which combines a convergence theorem and an algorithmic im- 
plementation. The study of non-polygonal approximations of digitized curves 
with respect to improvements in convergence speed appears as another impor- 
tant open problem. 

For all multigrid-convergence problems, it is important to determine what 
optimum convergence speed n{r) is actually possible (for example, see open prob- 
lem defined by Theorem 2). A test set of six curves has been specified in [23] 
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for evaluations of curve length estimations, and general ellipsoids are proposed 
in [18] for surface area performance evaluations. Evaluation measures might be, 
e.g., absolute error, efficiency (error times number of generated segments), and 
computing time. A classification of properties C,A,Aip^q,V,S,T>,... with re- 
spect to families of sets, optimum convergence speed, and optimum algorithmic 
time complexity might be a long-term project. 
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Abstract. This paper* looks at the problem of approximating the length 
of the unknown parametric curve 7 : [0, 1] ^ IR" from points qt = "fiti), 
where the parameters ti are not given. When the U are uniformly dis- 
tributed Lagrange interpolation by piecewise polynomials provides effi- 
cient length estimates, but in other cases this method can behave very 
badly [15]. In the present paper we apply this simple algorithm when 
the ti are sampled in what we call an e-uniform fashion, where 0 < e < 1 . 
Convergence of length estimates using Lagrange interpolants is not as 
rapid as for uniform sampling, but better than for some of the examples 
of [15]. As a side-issue we also consider the task of approximating 7 up 
to parameterization, and numerical experiments are carried out to inves- 
tigate sharpness of our theoretical results. The results may be of interest 
in computer vision, computer graphics, approximation and complexity 
theory, digital and computational geometry, and digital image analysis. 



1 Introduction 

Recent research in digital and computational geometry and image analysis con- 
cerns estimation of lengths of digitized curves; indeed the analysis of digitized 
curves in IR^ or IR^ is one of the most intensively studied subjects in image data 
analysis. This paper contributes to this topic by showing that there are possible 
improvements in convergence speed compared to all known methods in digital 
geometry, however, based on sampling of curves (as common in approximation 
theory) compared to digitization (as common in digital geometry). 

A digitized curve is the result of a process (such as contour tracing or 2D 
thinning extraction) which maps a curve 7 (such as the boundary of a region) 
onto a computer-representable curve. An analytical description of 7 : [0, 1] — > IR^ 

* This research was performed at the University of Western Australia, while the third 
author was visiting under the UWA Gledden Visiting Fellowship scheme. Addi- 
tional support was received under an Australian Research Council Small Grant^ and 
under an Alexander von Humboldt Research Fellowship.^*’ 
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is not given, and numerical measurements of points on 7 are corrupted by a 
process of digitization: 7 is digitized within an orthogonal grid of points ( ^ , ^ ) , 
where i,j are permitted to range over integer values, and m is a fixed positive 
integer called the grid resolution. 

Depending on the digitization model [9] , 7 is mapped onto a digital curve and 
approximated by a polygonal curve 7^ whose length is an estimator for d{'j). 
This is a standard approach for approximating a digital curve with respect to 
geometric analysis tasks. However, different smooth approximations (e.g. snake 
model) are used in image analysis as well, where the convergence analysis with 
respect to geometric figures is omitted. Approximating polygons 7™ based on 
local configurations of digital curves do not ensure multigrid length convergence, 
but global approximation techniques yield linearly convergent estimates, namely 
d(7) — d(7m) = 0{:^) [1], [11], [12] or [20]. Recently, experimentally based results 
reported in [4] and [10] confirm a similar rate of convergence for 7 C In the 
special case of discrete straight line segments in IR^ a stronger result is proved, 
for example, [6], where 0{ 5 ) order of asymptotic length estimates are given. 

In Theorems 1 and 2 presented in this paper the convergence is of order at least 
0{ ) and 0{:^) when 0 < £ < 1, respectively. 

For fc > 1, consider the problem of estimating the length ^(7) of a regular 
parametric curve 7 : [0,1] ^ IR" from m+ 1-tuples Q = {qo,qi, . . . ,qm) of 
points qi = "/{ti) on the curve 7. The parameters ti are not assumed to be given, 
but some assumptions are needed to make our problem solvable. For example, if 
none of the ti lie in (0, the task becomes intractable. The problem is easiest 
when the U are chosen in a perfectly uniform manner, namely ti = -^ (e-g- see 
also [14] or [21]). In such a case it seems natural to estimate 7 by a curve 7 that 
is piecewise polynomial of degree r > 1. We prove first in this paper: 

Theorem 1. Let 7 he and let ti be sampled perfectly uniformly. Then 

there exists piecewise-r- degree polynomial 7, determined by Q such that dif)) = 
d(7) -I- 0{ ^r+p ); where p is 1 or 2 according as r is odd or even. 

As usual, 0{g{m)) means a quantity whose absolute value is bounded by some 
multiple of g{m) as m 00 . We are principally concerned with non-uniform 
sampling. More precisely 

Definition 1. For 0 < £ < 1, the tfs are said to be £-uniformly sampled 
when there is an order-preserving reparameterization (f> : [0,1] — *■ [0,1] 
such that ti = + 0 (;t^). 

Note that £-uniform sampling arises from two types of perturbations of uni- 
form sampling: first via a diffeomorphism </> : [0,1] — > [0,1] combined subse- 
quently with added extra distortion term 0{ ). In particular, for if> = id and 

£ = 0 (£ = 1) the perturbation is linear i.e. of uniform sampling order (quadratic), 
which constitutes asymptotically a big (small) distortion of a uniform partition 
of [0, 1]. The extension of Definition 1 for £ > 1 could also be considered. This 
case represents, however, a very small perturbation of uniform sampling (up to 
a 0-shift) which seems to be of less interest in applications. 
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Lagrange estimates of length can behave badly for 0-uniform sampling (the 
more elaborate algorithm of [15] is needed for this case), but for 0 < e < 1 
we prove the following, using piecewise-quadratic Lagrange interpolants Q* * (see 
Section 4). 

Theorem 2. Let the ti he sampled e-uniformly, where 0 < e < 1, and suppose 
that fc > 4. Then, there is a funetion** 7 , determined by Q, sueh that ^( 7 ) = 
dil) + 0{^). 

Whereas Theorem 1 permits length estimates of arbitrary accuracy (for r suffi- 
ciently large). Theorem 2 refers only to piecewise-quadratic estimates, and ac- 
curacy is limited accordingly. The interest in this (the main result of the present 
paper) lies in the non-uniform distribution of the unknown parameters ti’s. The 
proofs of Theorems 1 and 2 also permit uniform estimates of 7 up to repa- 
rameterization. Note that the construction of the piecewise-r-degree polynomial 
interpolant including Q* (see Sections 3 and 4) requires neither the explicit 
knowledge of 7 nor of the parameters ti (each P-! is constructed over a uniform 
local grid in s S [0,rj; for Q* over uniform grid in [0,2]). The latter are used 
merely to compare ^( 7 ) with d(^) and 7 with 7 , respectively. More specifically, in 
order to prove Theorems 1 and 2 both global and loeal t- and s-parameterizations 
shall be used. On the other hand, the explicit construction of the interpolant PJ 
(or Q*) approximating 7 (and thus d('-f)) resorts exclusively to the local param- 
eterization. 

For these results Q arises from uniform or e-uniform samplings as opposed to 
digitization. So strict comparisons cannot be made. Our results seem relevant to 
digital and image geometry nonetheless for the following reasons. They provide 
comparisons with the interpolation and indicate potential problems which might 
arise in digitization based on non-uniform distribution of ti. Moreover, they 
show that using piecewise Lagrange polynomial approach to estimate length 
of a digiztized curve 7 may not always be appropriate. Finally, as a special 
case we provide upper bounds for optimal rates of convergence when piecewise 
polynomials are applied to the digitized curves. Related work can also be found 
in [2,3,7,8,17,19]. There is also some interesting work on complexity [5,18,22,23]. 

2 Sampling and Curves 

We are going to discuss different ways of forming ordered samples 0 = tp < 
ti < ^2 < ■ • ■ < tm = 1 of variable size m -I- 1 from the interval*** [0, 1]. The 
simplest procedure is uniform sampling, where U = :^ (where 0 < i < m). 
Uniform sampling is not invariant with respect to reparameterizations, namely 
order-preserving diffeomorphisms (j) : [0,1] — > [0,1], where fc > 1. A small 
perturbation of uniform sampling is no longer uniform, but may approach unifor- 
mity in some asymptotic sense, at least after some suitable reparameterization. 

** See section 4 for details. 

* * * In the present context there is no real gain in generality from considering other 

intervals [0,T]. 
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The e-uniform sampling in Definition 1 of the previous Section is a possible ex- 
ample of such perturbation. Note that (j) and the asymptotic constants are chosen 
independently of m > 1, and that e-uniform implies J-uniform for 0 < (5 < e. 
Uniform sampling is e-uniform for any 0 < e < 1 . At the other extreme are 
examples, where sampling increments U — ti-\ are neither large nor small, con- 
sidering m, and yet sampling is not e-uniform for any 0 < e < 1: 



Example 1 . Set U to be or according as i is even or odd. Then ( 1 / 2 to) < 
ti — U_i < ( 3 / 2 to) for all 1 < f < m and all m > 1 . Thus sampling is 0 - 
uniform. To see that sampling is not e-uniform for e > 0 assume the opposite. 
Then, for some reparameterization (j) : [ 0 , 1 ] ^ [ 0 , 1 ], U-i-i ~ = 2m “ 

</> (^) - (;^) +0(;^) and U+2 -U+i = ^ = <l> (^) - i"-^)+ 0 {^). 

By the Mean Value Theorem 



1 

2m 









_ 3 _ 

2m 



■n4T') 

m 



o{- 



m 



l + £ 



), 



( 1 ) 



for some and G Fixing i and increasing m, 

^ '/''(O) and </>'(C2r^) ^ </>'( 0 )- On the other hand, by (1), ^ 

1/2 and — > 3 / 2 : a contradiction. 



Let II • II be the Euclidean norm in IR", where n > 1 , with < •, • > the cor- 
responding inner product. The length d{'y) of a parametric curve {k > 1 ) 
7 : [0,1] — > M" is defined as d{j) = /q^ ||7(t)||dt, where j{t) G M" is the 
derivative of 7 at f G [ 0 , 1 ]. The curve 7 is said to be regular when 7(f) 0 , 

for all t G [ 0 , 1 ]. A reparameterization of 7 is a parametric curve of the form 
'yoip : [0, 1] ^ K", where '0 : [0, 1] — > [0, 1] is an order-preserving diffeomor- 
phism. Clearly, 70^/ has the same image and length as 7. Let 7 be regular: then 
so is any reparameterization ^ o ij). The curve 7 is parameterized proportionally 
to are-length when ||7(t)|| is constant for t G [ 0 , 1 ]. We want to estimate ^(7) 
from ordered m -I- 1 -tuples Q = (go,9i,92, ■ ■ ■ ,qm) G (IR"')™+^, where qi = 7(U), 
whose parameter values U G [ 0 , 1 ] are not known but sampled in some reason- 
ably regular way: sampling might be e-uniform for some 0 < e < 1. e-uniform 
sampling is invariant with respect to reparameterizations ip : [0, 1] — > [0, 1]. 
So suppose, without loss of generality, that 7 is parameterized proportionally to 
arc-length. 

We close this section with Figure 1 indicating why arbitrary sampling and 
piecewise-quadratic Lagrange interpolation (see Section 3 ) in most cases gives 
poor estimates for d{'y) (and indeed for 7). In Figure 1 only the uniform data 
yields reasonable approximations. In the next sections we show that some kinds 
of non-uniform sampling also give good approximations. 



3 Uniform Sampling 

We first consider length estimates of 7 in the easier case, where the U’s are sam- 
pled perfectly uniformly: U = ^ (with 0 < i < m). Suppose k = r-|- 2 , where r > 
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Fig. 1. Absolute errors E = \ t : — d(7)| for a unit semicircle approximated with 
the piecewise-quadratic interpolant 7: (a) For perfectly uniform sampling E = 
0.00362662. (b) For 0-uniform sampling (where = | for i even and ti = ^ ~ ^ 
for i odd; 0 < i < 6) E = 0.323189. (c) For some random sampling E = 0.22992. 
(d) For another random sampling E = 0.15394 



1, and (without loss of generality) that m is a multiple of r. Then Q gives ^ r+1- 
tuples of the form (go, <?i, • ■ • , Qr), {qr,qr+i, • ■ • , q 2 r), ■ ■ ;{q m—rt qm—r+lt • • • : qm^ • 
The j-th r + 1-tuple is interpolated by the r-degree Lagrange polynomial : 
[0,r] ^ M", here 1 < j < f: P^O) = P^l) = qu-iy+u- ■ .P^ir) = q,r- 

Note that P^ is defined in terms of a local variable s G [Ojc]. Recall Lemma 2.1 
of Part 1 of [13]: 

Lemma 1. Let f : [a,b] — > M" be O’", where I > 1 and assume that /(to) = Oj 
for some to G (a, b). Then there exists a function g : [a, b] K" such that 

f(t) = (t-to)h(t). 

The proof of Lemma 1 shows that 5 = O(^). If / has multiple zeros to < ti < 
. . . < tk then fc -h 1 applications of Lemma 1 give 

f{t) = {t- to){t - ti){t -t 2 ) ...{t- tk)h{t), (2) 

where h is and h = 0(7ps+r)- 

Assuming that 7 is (i.e. fc = r -|- 2) we are now going to prove The- 

orem 1, where estimation of ^(7) is based on piecewise-r-degree polynomial 
interpolation. For each j-th r-tuple consider the interpolating polynomial Pf . 
Let ip : [t(j-i)r,tjr] [Ojr] be the affine mapping given by V'(t(j-i)r) = 0 



344 Lyle Noakes et al. 



and 'ip(tjr) = f, namely ipit) = mt — (j — l)r. Thus ipit) is identically m (a 
diffeomorphism) . Note that since both intervals and [0,r] are uni- 

formly sampled, ip maps the tfs to the corresponding grid points in [0, r\. Define 
7 j = Pf o tp ^ BL". Then as if is affine, 7 ^ is a polynomial of 

degree at most r. Note that f = Jj — J ■ tjr] — > H" is and satisfies 

= /(to-i)r-+i) = • • • = fitjr) = 0. By (2) 

f{t) = {t — ■ ■ ■ {t — tjr)h{t), (3) 

where h : > H" by Lemma 1 is Still by proof of Lemma 1 

'■<'> = = O(^) = 0(1), (4) 

because degfjj) < r and jip+r is 0(1)- Thus by (3) and (4) 

m = o{^), ( 5 ) 

771’^+^ 

for t G trj\. Differentiating function h (defined as a r+ 1-multiple integral 

of over the compact cube [0, see proof of Lemma 1) yields 

m^oigi)-oCgi)-ou, (e, 

as deg{jj) < r. By (3) and (6) / = 0(;^) and hence for t G 

i{t) - 7j{t) = fit) = 0{^). (7) 

Let V^{t) be the orthogonal complement of the line spanned by j{t). Since 

II7WII = 



^jit) 



< ijit),iit) > 

dii)^ 



iit) + v{t), 



( 8 ) 



where v{t) is the orthogonal projection of 7j(t) onto V^{t). Since 7j (t) = f{t) -\- 
7 (t), wehave%(t) = (1+ ^^^d(f^jP^ )iit)+vit). Furthermore, by (7), v = 0(;^). 
Since < >= 0, by the Binomial Theorem the norm || 7 j(t)|| = 



II 7 WIU 1 + 2 



< fit),ijt) > 

dil)^ 



■0( — ) = ll7WII(l 



< fit),ijt) > 

dil)^ 



) (9) 



up to the 0{:^) term; note that by (7) <; holds 

asymptotically. Integrating by parts, II II - Il7(0ll) dt = 




< fit),'iit) > 
d{l) 



dt 0(- 



,2r+l 



) = - 



I. 






< f(t),7(t) > 
d{l) 



dt-\- 0{- 



m 



2r+l ^ 



Length Estimation for Curves with Different Samplings 345 



Since 7 is compact and at least by (4) and h = 0(1) we have < >= 

0(1) and < /i(t), 7 ^^^(t) >= 0(1). Similarly, by ( 6 ) we have < h(t),^{t) >= 
0(1). Hence, by (3) and Taylor’s Theorem applied to r{t) =< > at 

t = we get < >= {t - t(j_i)^)(t - t(j_i)^+i) ...{t- tjr){a + 

0(;j^)), where a is constant in t and 0(1). Since sampling is uniform the inte- 
-i)r){t — t{j-i)r+i) ■ ■ - {t — tjr)dt vanishes when r is even. So 
^ /W> 7 (^) > dt is either 0 (;;^) or according as r is 

odd or even. As 2r + 1 > r + 3 (for r > 2) and 2 r + 1 > r + 2 (for r > 1 ), 




(ll7,WII 



II 7 WII) dt 



0 (;T^) if r > 1 is odd 
0 (;T^) if r > 2 is even. 



Take 7 to be a track-sum of the jj, i.e. d{j) = ^d( 7 j) = d{'y) + 0{ ^}+p ), 

where p is 1 or 2 according as r is odd or even. This proves Theorem 1. 

Notice that, by (5), perfectly uniform sampling permits estimates of 7 with 
uniform 0( ) error. Next we consider non-uniform samplings, for which 

piecewise-quadratic interpolation gives good length estimates. 



4 e-Uniform Sampling 



Let fc = 4, so that 7 : [0, 1] — *■ IR" and its reparameterizations are at least C"*. 
Fix 0 < £: < 1, and let the ti’s be sampled e-uniformly. We are going to prove 
Theorem 2. Without loss of generality m is even. For each triple (q^, 9 ^+ 2 ), 

where 0 < f < m — 2, let Q* : [0,2]^ IR" be the quadratic curve (expressed in 
local parameter s S [0,2]) satisfying Q®(0) = qi, Q*(l) = qt+i, and Q*(2) = qt+ 2 - 
Write Q®(s) = qi + ais + , where s G [0, 2]. Then 

4pi+i - 3pi - g*+2 q^+2 - 2qz+i -h qi , . 

oo = qi, ai = and 02 = . (10) 

By Taylor’s Theorem 'y{tq) = jiU) + 'j{ti){tg - U) + (l/2)7(^,))(f, - UY, for 
either q = i + \ ov q = i + 2 and some ti < < tq. Combining the latter with 

l{U) = qi, 7 (ti+i) = qi+i, l{ti+ 2 ) = qi +2 and substituting into (10) yields 



02 — (l/2)7(ti)(ti+2 — 2ti+i -|- ti) + 0{ — j). 

Because sampling is e-uniform the Mean Value Theorem gives 



tq ti — 4> (riq) 



( 11 ) 



(12) 



for either q = i + 1 or q = i + 2 and some U < ijq < tq. Thus by (11) and (12) 

(13) 



ti+2 — 2 ti+l +ti.,. , 1 . 

02 = ;; 7(^i) + 0 {^). 



Furthermore 



ti+2 — ‘^ti+i + ti — (f>{- 



-)-<^(^)-( 0 (^)- 0 (-)) + O(^)(14) 

1 m mm m^^^ 
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because sampling is e-uniform. By Taylor’s Theorem the following holds 



</)( ) = </>(—) -k <^( — )— and </)( 

m m m m m 



^) = <^(-) + 0(-)-, (15) 

m mm 



up to a 0(;^) term. Substituting (15) into (14) and taking into account e- 
uniformity renders 



ti+2 — 2ti+i + ti — 0{ — yrr)- 

The latter combined with (13) and e-uniform sampling yields 



m 






m 



l+£' 



A similar argument results in 



Oi = 



4ti+i - 3L; - t 



i+2. 



7(^0 + O(-^) = O(-). 



(16) 



(17) 



(18) 



From (17) and (18), 



^ = a. + 2», = 0(l) and ^ = 2a, = 0(^), (19) 

where s G [0, 2]. Let ijj : [ti, ti+ 2 ] [0, 2] be the quadratic tlj{t) = bo + bit + 

satisfying tp{ti) = 0 , 'tp{ti+i) = 1 , and tlj{ti+ 2 ) = 2 (although ij) depends on i we 
suppress this in the notation). Inspection reveals 61 = {U+i — ti)~^ — b 2 {ti+i + 

ti), and 62 ((^ 2 -t-l ^i) (^Z + 2 ^ 2 -t-l)) [(^ 2+1 ^z)(^ 2 -t -2 ^ 2 -t-l )(^2 + 2 ^i)] 

Furthermore, as before, by e-uniformity (ti+i — ti) — {ti +2 — ti+i) = 0( ), 

and w?{ti+i — ti){ti +2 — ti+i){ti +2 — ti) = 0 ( 1 ), where the right-hand side of the 
latter is bounded away from 0 (as is a diffeomorphism defined over a compact 
set [0, 1]). Hence, 

62 = 0{m^~‘^) and 'ip{t) = 2^2 = 0 (m^“^). ( 20 ) 



As easily verified {ti+i — ti) ^ = 0{m). Hence, coupling bi = {ti+i — ti) ^ — 
62 (^ 2 +! + ti) with ( 20 ) yields 

ip{t) = bi + 2 & 2 t = 0(m) -I- & 2 ( 2 t - {ti+i + ti)) = 0(m), (21) 

as sampling is e-uniform and 2t — {ti+i + ti) = 0{^), for t G [ti,ti+ 2 ]- In 
particular, ip is a diffeomorphism for m large. Define 7 ^ = Q'^oip : [ti, ti+ 2 ] — *■ H"- 
Then % is polynomial of degree at most 4. Its derivatives of order 2 < p < 4, 
are 0 (mO-i)(i-®)). jj^jeed, by (19), (20), (21), deg{il)) < 2 and deg{Qp) < 2 

7 , = Q* + Q* ^ = 0{—^)0{m'^) + 0 (— ) 0 (m^"®) = 0 ( 771 ^"^^), (22) 



7p^ = 3Q* 'ip’ip = 0{ — \—)0{m)0{rn^ ^) = 0{rri^ ‘^^), (23) 



7 W = ^ 1 )o(^4-2e) ^ o(m3-3-). 

771-'-+^ 



( 24 ) 
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Then / = 7^ - 7 : [ti,ti+ 2 ] — > K" is and satisfies /{U) = /(ti+i) = /(ti+2) = 
0. By (22), (23), (24) and er-uniformity we have 

§ = § = 0^1-)). ^ = 0(™«->). (25) 

Use Lemma 1 to write f{t) = — — — where h : [ti,ti+ 2 ] 

M" is C^, respectively. Then again by Lemma 1 and (25) we have h = 0{'^) = 
0(m^(^“®)). Furthermore (6) coupled with (25) renders h = -'^)). The 

latter combined with the e-uniformity yields 

f=0{^) and f = 0{ / ). (26) 

As in the proof of Theorem 1 define (t) to be the orthogonal complement to 
the space spanned by j{t). Then expand Jj{t) according to (8), where v{t) is 
the orthogonal projection of 7j(t) onto tA)-(t). Similarly to (9), by using (26) we 
arrive at r; = 0(;^) and thus 



Il7,«)ll = Il7«)ll(i + ^ (27) 

for which we use e G (0,1]- Consequently, by (9), (26), and (27) the integral 

itr\\mm-\\m\)dt = 



< > 
d{j) 



dt + 0{- 



1 



m 



l+iE ' 



■/: 



< /(^): 7 (^) > 
d{l) 



dt 



uptoO(;7r_),Now< /(t),7(f) >= {t-ti){t-ti+i){t-ti+ 2 )r{t),wherer{t) =< 
>. Taylor’s Theorem applied to r at ti yields r{t) = r{ti) + {t — ti)r{^), 
for some U < ^ < ti+2- Similarly to the argument used for uniform sampling, 
r{ti) = and r = Consequently, by (16) — ti){t — 

ti+i){t - U+2)dt = ^(ti - ti+2f{ti+2 - 2 ti+i -h ti) = 0 (;^^) and hence the 
integral = 

[ {t - ti){t - - ti+2)dt + 

d[V Ju ^ 

is O(tj^). So again by er-uniformity //;+"(||7i(i)ll ~ 117(^)11) dt = + 

0 (;Titi?) = 0 (;tit 3 f), and we finally arrive at d{j) = = d(y) -h 

This proves Theorem 2. 

Notice that e-uniform sampling, permits (by (26)) estimates of 7 with uniform 
0 ( 7 ; ) error. Moreover, by taking e = 1 in Theorem 2 we obtain a stronger 

statement (as 0-perturbation of uniform sampling is still allowed; see Definition 
1) than Theorem 1 when r = 2. Note also that Theorem 2 has nothing to say 
about the case e = 0. This is dealt with in [15] using a different approach. 
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5 Experiments 

Next we test the sharpness of the theoretical results in Theorems 1 and 2 
with some numerical experiments. Our test curves are a semicircle and cu- 
bic 7s, 7c : [0,1] — > given by 7 s(t) = (cos(7t(1 - t)),sin(7r(l - t))) and 

7c(i) = Of course d{'js) = tt, and numerical integration gives 

d{jc) = 3.3452. Experiments were performed with Mathematica. 

5.1 Uniform Sampling 

We first discuss convergence of length estimates for piecewise polynomial approx- 
imations and perfectly uniform sampling. Experiments were conducted for both 
test curves, with r = 1, 2, 3, 4 for which Theorem 1 asserts errors that are 
O(^), 0{:^), and 0(;^), respectively. For each r = 1, 2, 3,4 the minimum and 
maximum number of interpolation points were (mini,maxi) = (7711712,7710x2) = 
(7,101), {min^,max^) = (7,100), and {min 4 ,max 4 ) = (9,101). Let jr,mr rep- 
resent a piecewise-r-degree polynomial interpolating nir points. In each row of 
Tables 1 and 2 (for a fixed 1 < r < 4) we list only some specific values obtained 
from the set of absolute errors = |d(7) — d(7r,mc)l (here indexes 

lr,mri where mirir < trir < maXr and nir = rn + 1), namely: £’““^''(7) = 
uiaxminr.<mr.<maxr. Emr-il) and EmaxAl) ■ Moreover, for each r, in searching 
for the estimate of convergence rate 0{:^) (where m -|- 1 = irir is a num- 
ber of interpolation points) a linear regression is carried out on pairs of points 
(log(mr — 1), — log(£mr(7))j where mirir < ir ^ maXr and ir = rn + 1. Here 
are the results. Both Tables 1 and 2 suggest that (in these cases at least) the 
statements in Theorem 1 are sharp. (The last two rows of Table 2 are somewhat 
irrelevant in that Lagrange interpolation returns, for r > 3, the same curve 7c, 
up to machine precision.) 

5.2 e-Uniform Sampling 

A full report on experiments with piecewise-quadratic Lagrange interpolation 
with e-uniform sampling is given in [16]. We experimented with e\ = 1, ei/2 = 
1/2, ei/3 = 1/3, and for I = 1,2,3, with diffeomorphisms (j)i : [0,1] — > [0,1], 



Table 1. Results for length estimation d(7s) of semicircle 7g. 



r 


mirir 


maXr 


a (where Emr oc m “) 


r-imaxr \ 

^mirir W ^ / 


Emaxr 


1 


7 


101 


1.99943 


0.0357641 


1.29191x10“'* 


2 


7 


101 


3.98485 


0.0036266 


5.09874x10“® 


3 


7 


100 


3.97964 


0.0026509 


3.98087 xl0“® 


4 


9 


101 


5.98218 


0.0001136 


3.19167 xl0““ 
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Table 2. Results for length estimation d{'js) of cubic curve 7c- 



r 


mirir 


maxr 


a (where Em^ oc m “) 


^maxc( ) 
mirir '' ' ' 


Emaxr 


1 


7 


101 


2.00006 


0.0357641 


5.18348x10"® 


2 


7 


201 


4.09546 


0.0036266 


1.22657x10"^^ 


3 


7 


100 


n/a“ 


5.90639 


4.44089x10"^®“ 


4 


9 


101 


n/a“ 


2.73115x10"^^“ 


4.44089x10"^®“ 



not applicable (see above). 



given by cj)i{t) = t, 4>2{t) = + 1), and These functions 

are used to define first e-uniform random sampling 

% 1 

U = 4>i{ — ) + {Random[ ] — 0.5) — r— , (28) 

where Random[ ] takes the pseudorandom values from the interval [0, 1] and 
0 < i < m. In addition, we experimented with two other families of skew- 
symmetric e-uniform samplings with 4>i and 0 < * < m: 

— if i even, 

m ’ 

m + M I = Ak + I, (29) 

^ if f = 4fc + 3. 

In all cases to = 0,ti = 1. Piecewise-quadratic interpolation was implemented 
for both kinds of sampling, with m even running from m = 6 up to m = 100 
and to m = 200, respectively. These experiments with 7^ and 7c showed faster 
convergence than proved in Theorem 2 for sampling (28). However, the statement 
of Theorem 2 appears to be sharp for the samplings (29) : the observed rates of 
convergence nearly coincide with those asserted by the theorem: a\ = 4, ai/2 = 2 
and ai/3 = 4/3. Note also that for 0-uniform sampling (29) (i), and for semicircle 
7s and cubic curve 7c, a piecewise-quadratic Lagrange polynomial interpolant 
does not provide good estimates of d{'js) and d{^c), respectively (see Figure 2). 

6 Conclusion 

The problem of estimating ^(7) of a curve seems rather straightforward when 
the parameter values € [0, 1] are given, for example when sampling is uniform. 
This paper examines a class of samplings for which the same simple methods 
give length estimates converging to the true value ^(7), including investigation 
of convergence rates. Our results appear to be sharp for the class of samplings 
studied in this paper. Piecewise Lagrange interpolation does not work well for 0- 
uniform samplings (more elaborate methods for dealing with these are given 
in [15]) and so the class of e-uniform samplings is of special interest where 



(*) k = - + 



(- 1 ) 



2+1 



m 2mi+^ 



{ii) ti = 
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Fig. 2. Absolute errors plotted for 0-uniform skew sampling (29) (i) (where m2 
is even and 6 < m2 < 200): (a) Emails) against m2, (b) against m2 



0 < e < 1 . In general, the relationship between convergence of length estimates 
and uniform convergence to the image of 7 seems not quite straightforward. 
Because the methods used herein are relatively simple, they are also widely 
applicable. Unlike the situation in [15] there is no convexity requirement on 7, 
and there is no need to restrict attention to planar curves. 
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Abstract. The 1-D Leap-Frog Algorithm [12] is an iterative scheme for 
solving a class of nonlinear optimization problems. In the present paper^ 
we adapt Leap-Frog to solve an optimization problem in computer vision. 
The vision problem in the present paper is to recover (as far as possible) 
an integrable vector field (over an orthogonal grid) from a field corrupted 
by noise or the effects of digitization of camera images. More generally, we 
are dealing with integration of discrete vector helds, where every vector 
represents a surface normal at a grid position within a regular orthogonal 
grid of size N x N. Our 2-D extension of Leap-Frog is a scheme which 
we prove converges linearly to the optimal estimate. 1-D Leap-Frog [12] 
can deal with nonlinear problems such as are encountered in computer 
vision. In the present paper we exploit Leap-Frog’s capacity to handle 
large number of variables for a linear problem (in this situation Leap-Frog 
becomes an extension of Gauss-Seidel) , and we offer a geometrical proof 
of convergence for the case of photometric stereo (see e.g. [10,11]), where 
data is corrupted by noise or digitization. In the present paper, noise 
enters in an especially simple way, as Gaussian noise added to gradient 
estimates. So this as a first step towards more realistic (and demanding) 
applications, where Leap-Frog’s capacity to deal with nonlinearities is 
needed. The present paper also offers an alternative to other methods in 
photometric stereo [3], [6], [13], and [15]. The performance of 2-D Leap- 
Frog was demonstrated in [14] without proof of convergence: established 
methods are faster, but without Leap-Frog’s capacity for generalization 
to nonlinear problems. 



1 Introduction 

A monochrome picture of a smooth object typically exhibits brightness variation 
or shading. Shape- from- shading is the problem of determining the visible part 
of the object from the picture. As shown in [5] (see also [7]), this corresponds 
to solving a nonlinear first-order partial differential equation. Specifically, one 
seeks a function u, representing surface depth in the direction of the z-axis, 
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satisfying an image irradiance equation. For a a Lambertian surface (a perfect 
light diffuser) with constant albedo, illuminated from the light-source direction 
{pi,P 2 ,P 3 ), the image irradiance equation is 



Piu^jx, y) -h P 2 Uy{x, y) - p-j 



\JpI+pI+ pI^uHx, y) + ul(x, y) + l 



= E{x,y) 



( 1 ) 



over an image H C IR^. Here E is an image intensity formed by orthographic 
projection onto the plane of the image The unknown surface S is the graph 
of the function u, which is to be determined up to translations u u + c. A 
single image shape-from-shading problem (with no additional constraints) is, in 
general, ill-posed in the sense that not enough data is given for the problem (1) 
to be uniquely solved. For background we refer to [-5], [7] or [9]. 

In multiple light-source shape from shading the situation is simpler in some 
respects, namely a Lambertian surface is generically determined up to translation 
by multiple images, at least over the intersection of their respective domains (see 
[5, subsection 10.16] or [10,11]). Shape reconstruction, as in multiple light-source 
photometric stereo, can be decomposed into: gradient computation (an algebraic 
step) and gradient integration (an analytic step). What complicates the problem, 
especially when images are subject to noise or digitization, is that the tableaux 
of computed values v^,v'^ of Ux,Uy usually do not correspond to a function u. 
To see why not suppose that u is C^. Then Uxy{x, y) = Uyx{x, y). It follows that 
a necessary condition for v = , v^) to correspond to a function is Vy = Vx- 

When f? is simply-connected this integrability condition is also sufficient for the 
computed values of v^,v'^ to correspond to an image. From now on take to 
be the unit square [0, 1] x [0, 1], and consider the problem of correcting v^) 
to an integrable field before passing to gradient integration. This second step is 
easy, and then u is determined up to translation. 

A natural way to correct v is to choose the closest integrable vector field, 
depending on what is meant by “close” (here we use the standard definition). 
This, if attempted directly, results in a large-scale linear algebra, which is in- 
tractible by elementary methods. By contrast, each step of 2-D Leap-Frog is 
a small-scale optimization problem, and the method consists of blending these 
elementary tasks by means of an iterative scheme. 

Known techniques for handling the problem include [5, subsections 11.7-11.8] 
and [6] which minimizes different best fit functionals. One of Horn’s variants min- 
imizes the following functional fj^((ux(x, y)—v^(x, y))^-\-(uy(x, y)—v^(x, y))^)df2 
whose Euler-Lagrange equation is a Poisson equation Art = /. A natural bound- 
ary condition makes the corresponding problem well-posed. Fast methods of 
solving Poisson’s equation exploiting the linearity of Art = / can be then be 
used [2] or [15]. A different minimization scheme was introduced in [3], using 
projection to the closest function u expanded as a Fourier series. Another alter- 
native is the Lawn-Mowing Algorithm [13], which gives suboptimal solutions. 

Theorem 1 of the present paper asserts linear convergence of 2-D Leap-Frog 
to the optimal solution estimate (D^, v^). There is no claim that our method com- 
petes for the speed with established methods based on Poisson’s equation, but 
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our geometrical proof seems interesting in its own right, and the construction is 
capable of considerable generalization [8], [12]. Our proof also seems more acces- 
sible than known algebraic proofs for classical iterative schemes for large linear 
systems. In the present linear setting 2-D Leap-Frog can be seen as a subclass 
of such methods, but 1-D Leap-Frog solves nonlinear optimization problems in 
a geometrical setting. So extensions of 2-D Leap-Frog to the nonlinear case are 
likely to be useful for nonlinear optimization problems arising in computer vision. 
In fact a number of further nonlinear applications are planned, to demonstrate 
the generality of this technique. 

Although we focus here on rectifying nonintegrable vector fields in the con- 
text of noisy or digitized photometric stereo images, similar techniques can also 
be applied to digitized or noisy optical flow or in physics where the unknown 
potenial u gives rise to the observable vector fields. 

2 The 2-D Leap-Frog Global Optimizer 

For n > m let L : M" ^ be a linear transformation of maximal rank m. 
Given L{v) ^ u, for some v G IR" and u G M™, consider the task of finding the 
closest V G K" to u S K” such that L(v) = u. It is well-known that jju — ull 
is minimized by the orthogonal projection of v on the affine subspace = 
Vp + Ker{L) C IR", where Ker{L) = span{fi, / 2 , . . . , fk\ is the kernel of L 
and Vp is a particular solution to Lix) = u (here k = n — m). In order to actually 
And V we need to compute oti using the following k x fc-linear system: < 

filfj >=< V — Vs\fj >, in principle solvable by the Gaussian elimination. 
When k is large and the matrix of L has special properties, Gauss-Seidel or, 
more generally, multiplicative Schwarz (see e.g. [1]) may be more appropriate. 
Alterntively one can use the pseudoinverse Lps = {LLA')~^ : — > M" for 

which 



it = V — {LL^) ^{L{v) — u). (2) 

As with ordinary inverses it is not always convenient to use the pseudoinverse. 
Direct or iterative methods of solution may be more efficient. From this point 
of view, 2-D Leap-Frog is an iterative scheme, particularly well suited to photo- 
metric stereo, and with links to other problems of interest. 

For many purposes the problem of approximating a noisy or digitized vector 
held by an integrable held can be reduced to linear algebra, by transforming 
the integrability condition into its discrete analogue. In more detail, for fixed 
k < I G IN divide the unit square SI into 2^* grid cells of the form = 

[{ig - l)/2\ig/2‘] X [(jg - I)/2',jg/2'j (for I < ig,jg < 2'). Of course, grid 
resolution is here 2^ From values of u on such a grid we could calculate central- 

difference derivative approximations with Ax = Ay = 1/2* Ux[i,j] ~ — ~ 

+ 1 

and Uy[i^j] « * for each side of a subsquare in the x and y directions, 

accordingly whether the side is horizontal or vertical. Along each grid cell, the 
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analytic integrability condition (modulo truncation error assumed here to be 
dominated by noise or digitization) translates into the discrete analogue 

Ux[i,j + 1] - Ux[i,j] = Uy[i+ l,j] - Uy[i,j]. (3) 

Assume that either a uniform Gaussian noise with mean zero is added to 
each j] and each Uy[i,j] independently, or alternatively that noise is gener- 
ated by digitization (the specific value of standard deviation shall be set later; 
see the closing section). The problem now is to estimate {ux, Uy) from 2*+^(2*-|-l) 
noise contaminated differences in such a way as to be closest to the discrete non- 
integrable field v = {v^,v'^). Accordingly, i) = {ux^Uy) should minimize the sum 
of squared residuals 

{{ux[i,j]-v^[i,j]f + {uy[i,j]-v^[i,j]f) . (4) 

Such a solution is said to be optimal, and finding it leads to a large system of 
linear equations in many unknowns: the need for some sort of iterative scheme 
is quite apparent. 

We now confine attention to a given S^j, with 1 < i,j < 2*+^“^ — 1 (de- 
noted also by St) having 2^^ grid cells, and review various optimization prob- 
lems considered over St subject to nine different boundary conditions, namely: 
top-right, left-top-right, left-top, top-right-bottom, top-right-bottom-left, bottom- 
left-top, right-bottom, right-bottom-left, and bottom-left. These are referred to in 
the description of 2-D Leap-Frog in subsection 2.1 (see also Figure 1). Note that 
1 < LJ < 2'+^-'= - 1 as we consider here only half overlaps. 

In doing so, we assign to nonintegrable vector field Vh = {v^[i,j],v'^[i,j]) 
(over each St) 2^+^ (2^ -|- 1) free variables corresponding to the unknown 
corrected values of the closest integrable vector field Vh- More specifically, 
over each St, define 2^'(2^ -|- 1) free variables (corresponding to 
namely: X\ = n^[0, 0], . . . , — 1, 0], a: 2 fc+i = w^[0, 1], . . . , a; 2 fc+i = 

— 1,1], and a: 22 fc+i = r;^[0, 2^'], . . . , a: 2 fc( 2 fc+i) = v^[2^ — 1,2^]. Analo- 
gously, we have 2^(2^ -I- 1) variables (corresponding to v'^[i,j]), namely: y\ = 
z;2[0,0], ...,y2^ = 'u2[0,2'=-l],y2^+i = . . . ,y2k+i = -l],y22h+i = 

z;^[2^, 0], . . . , y 2 fc( 2 fc-i-i) = i'^[2^,2^ — 1]. Assuming temporarily, that k = I (3) 
applied to each grid cell yields a homogeneous optimization system of 2^* lin- 
ear equations in 2*+^(2* -|- 1) unknowns L^{vz) = 0 , where 0 S and 

G with L^ivh) yf 0 (note that for / = fc we have = 0 as 

defined previously in this in section). Direct methods for solving such a global 
optimization problem (for which L^{vh) = 0) constitute, for / > 7, an unwieldly 
computational task. So further discussion focuses on local optimizations over 
each St (with k < 1) and on melding local optima into a global optimum (the 
2-D Leap-Frog). Note that as simple verification shows has maximal ranki.e. 
rank(L^) = 2^* and hence dim{ker{Vf)) = 2*+^(2* -|- 1) — 2^* = 2^(2* -|- 2). This 
is the dimension of the space of integrable vector fields. 
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As the analysis of the nine cases of different boundary constraints imposed 
on St for the 2-D Leap-Frog is similar we discuss here only top-right bound- 
ary conditions. Assume that boundary values are given, i.e. a:22fc+i = 

x° 2 k_^i,X 22 k _^2 = = ^2^2'^ -K) (^Presenting top bound- 

ary conditions for St) and 2/2=*^+! = y%+i, 2/22fc+2 = U%+ 2 ^ • ■ • , 2/2'=(2*^-i-i) = 
2/2fc(2ic_|_i) (representing right boundary conditions for St). Applying 2^^ integra- 
bility constraints (3) (along each grid cell) we arrive at an inhomogeneous system 
of 2^^ linear equations in 2^+^ (2^ -|- 1) — 2^+^ = 2^^+^ unknowns. Using notation 
from section 2 this system can be treated as L)^{xtr) = Utr (for L*^{vtr) 7^ Utr), 
with Xtrj Vtr € IR , Utr G BL and L" : IR ^ IR being a linear oper- 
ator. Note that for n = 2^^+^ and m = 2^^ condition n > m holds. It is straight- 
forward to show that Rank{L)^) = m. As L)^{vtr) yf Utr, the closest vector i)tr 
satisfying L)^{xtr) = Utr can be found as in section 2. Note that as k increases 
the dimensions of both linear spaces IR™ and IR" grow exponentially. The origi- 
nal problem (4) is reduced to a collection of computationally tractable problems, 
where I is replaced by 1 < fc < /. The operators ^ 

and L^)} are determined similarly. 

2.1 2-D Leap-Frog 

Suppose given a nonintegrable discrete vector field v over a rectangular grid in 
17 = [0,1] X [0,1] of grid size 1/2^ The optimization problem (4) is solvable in 
principle by the methods of section 2, but in practice this is unworkable by direct 
methods unless I is small. Certainly I > 7 makes diffhculties. So the following 
algorithm uses a sequence of optimizations where I is replaced by a smaller 
integer k. 

Cover 17 by a familiy of overlapping subsquares = {5^^- }i<ij<2'-‘^+i-i 
each comprising 2^^ grid cells Each S^j is defined as follows S^j = [(* — 

1)2'=-'-!, (i - 1)2'=-'-! -k 2'=-'] X [(j - 1)2'=-'-!, (j - 1)2'=-'-! -k where 

1 ^ j < 2^^+! _ X, The case when fc = / — 1 (and I > 2 is arbitrary) is shown 
in Figure 1. Let be any integrable vector field. Many choices are possible. For 
example we might use the methods introduced in [3], [6] or [13] to approximate 
V. The easiest (and perhaps best) choice is = 0. We present Leap-Frog with 
half Sfj overlaps, but Theorem 1 can be proved for overlaps between 1 and 2^ — 1 
pixels, using the same argument. 

For n = 1,2,... repeat the following sequence of steps until some halting con- 
dition is flagged (e.g. bounds imposed on the number of iterations, on estimated 
error, or on deficiency angle as in specified in section 4). 

— start with the left bottom subsquare Si[ and apply a least-square optimiza- 
tion with respect to^ v, with fixed top-right boundary constraints inherited 
from the values computed in the n — 1th iteration (for k = I — 1 see Figure 1 
(a)). 

^ More precisely, adjust the variables in the snapshot to minimize the distance to the 
given nonintegrable field v. 
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Fig. 1. Covering an image C by the family subsquares 

(here fc = / — 1). Each consists of grid cells 



— pass now to the second subsquare of the first row S 21 and optimize with 
fixed left-top-right boundary constraints. The left and left-half top boundary 
conditions are inherited from computed values over Si[ obtained in the nth 
iteration. The right and right-half top boundary conditions are inherited 
from the computed values in obtained n — 1th iteration (for k = I — 1 see 
Figure 1 (b)). 

— continue until the last subsquare row. Over this 

subsquare optimize with left-top boundary constraints fixed. The left and 
left-half top boundary conditions are given by the previously computed val- 
ues over Sj^ 2 ‘-k+i_ 2 )i nth iteration. The right-half top boundary conditions 
are given by the values obtained in the n — 1th iteration (for k = I — I see 
Figure 1 (c)). Thus the first row of nth iteration is completed. 

— next pass to the second row. Start with the 5'^2 subsquare and optimize with 
fixed top-right-bottom boundary constraints. The bottom and bottom-half 
right boundary constraints are inherited from nth iteration values computed 
over subsquares Si{, S 21 and S^{. The top and top-half right boundary 
conditions are inherited from values computed in the n — 1th iteration (for 
k = I — 1 see Figure 1 (d)). 

— pass to the second subsquare S 22 over which we optimize with top-right- 

bottom-left boundary constraints (a generic case). The left-half top, the 
left, the bottom and the bottom-half right boundary conditions are inherited 
from the values of the nth iteration computed over and S^[. The 

remaining boundary conditions are inherited from computed values obtained 
in the n— 1th iteration (for fc = / — 1 see Figure 1 (e); note that in case when 
k = I — 1 there is no dependence on ^ 41 -boundary) . 

— continue until the last subsquare S^^i_k+i_^-j 2 second row is reached. 

Over this subsquare optimize with bottom-left-top boundary constraints 
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fixed. The left-half top, the left, and the bottom boundary conditions are 
inherited from the values computed in the nth iteration over S^ 2 i-k+i_ 2)2 
and The right-half top boundary conditions are inherited from 

values computed in the n — 1th iteration. Thus the second row of the nth 
iteration is completed (for k = I — 1 see Figure 1 (f)). 

— continue row by row (as specified in the previous step), until the last 

row is reached. Now optimize over S^ 2 i~k+i_i with fixed right and bot- 
tom boundary constraints. The bottom and the bottom-half right bound- 
ary conditions are inherited from values computed in the nth iteration 
over and The top-half right boundary 

conditions are inherited from values computed in the n — 1th iteration (for 
k = I — 1 see Figure 1 (g)). 

— pass to the second subsquare of the last row S 22 i-k+i_i over which we op- 

timize with right-bottom-left boundary constraints fixed. The bottom-half 
right, the bottom and the left boundary conditions are inherited from values 
of the nth iteration computed over S^^i_k+i _2 and S^ 2 i-k+i_ 2 - 

The top-half right boundary conditions are inherited from computed values 
obtained in the n — 1th iteration (for k = I — 1 see Figure 1 (h); note that 
when k = I — 1 there is no dependence on 5'^2i-fc+i_2“t>oundary). 

— this process continues, up until the last subsquare 

in the last row is reached. Over this subsquare optimize with the 
bottom-left boundary constraints fixed and inherited from computed val- 
ues over 5'("2i-fc+i-2)(2i-''+i-i) 5'|'2i-fc+i_i)(2i-fc+i-2) iteration 

(for k = I — 1 see Figure 1 (i)). This completes the nth iteration, and the 
resulting integrable vector field is labelled 

Remark 1. During the noth 2-D Leap-Frog iteration once the mth Sm snapshot 
is reached we update free variables according to one of 9 different boundary 
constraints. Let v^_i G ^ be the current estimate of h e ^ 

obtained upon completion of local minimization over Sm-i within noth iteration. 
Let represent currently updated and fixed boundary values of Sm (all or 
some of them) taken from ^ according to one of the nine admissible rules. 
As previously the integrability condition over Sm reads Lm{X) = (there 
are 9 different operators Lm)- Assume v^^^m) defines the vector which instead 
of free variables (over a snapshot Sm) inherits the corresponding values from v. 
Leap-Frog solves the local optimization problem by 

K(m) = - L;i(Lm(v^(m)) ~ «”“ ) , (5) 

where L™ = L'^{LmL'^)~^, and corrects a generic local misfit Lmiv^i^m)) ^ 
to Lm{vl°m)) = , minimizing \\vl°m)~'"^{m)\\ over all integrable vector fields 

over Sm (subject to relevant boundary constraints). For m > 1, Leap-Frog up- 
dates to v'^ by replacing variables in v'^_i (corresponding to free variables 
in Sm) with v^°m)- Alternatively, 2-D Leap-Frog, can be viewed as successive or- 
thogonal projections of {z^ arises by replacing entries in with those in 
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V coinciding with free variables in Sm) on some affine subspaces A!^ C Ker{L^) 

Q I 1 Cq Z I -1 \ 

of integrable vector fields in IR . 



3 Convergence of 2-D Leap-Frog 

Here is our main result: 

Theorem 1. Let v be the optimal integrable vector field for the problem (4) 
described in section 2 and h” its estimate upon completion of nth iteration of 
2-D Leap-Frog. Then for some constant C € [0, 1) and for each n £ IN 

\\v-+^-^<C\\v--^. (6) 

Preliminaries: To prove Theorem 1 (but not to implement 2-D Leap-Frog) it 
is necessary to change the way we look at the h". Rather than regard these as 
vectors in IR ^ , it is better to view the v as abstract vectors or tableaux: 

a tableau is an array of real numbers of the form 

\ 



Vt = 



/ 


X22i+i 




X2i(2'+1) 




V 2 l 


o 


2/22' 


2 / 22 ' 0 


2/2'(2'-|-1) 




X221-1+1 




X221 






X2i + i 




X22' 




yi 


0 


2/2'+1 


?/ 22 '- 1-|-1 0 


2/22' +1 


\ 


Xl 




X2I 





(7) 



/ 

Even without the o symbols, a tableau is not a matrix (neither rows nor columns 
have the same number of entries). A o symbol in a tableau is called a nucleus 
and its adjacent real entries are its neighbours. Given I, the set of all tableaux 
is denoted by Vr. 



For vix: V 2 T G Vt define vix +t V2t by + Xs^'^ and = 



ys 



(where 1 < s < 2^(2* -|- 1)). For vt & Vt and y € IR, define wt 



fi -T Vt by = yXg'^ and (here similarly 1 < s < 2^(2* -|- 

1)). With respect to these operations of vector addition and scalar multiplica- 
tion, (Vt,+t, ’t) is a vector space, equipped with a natural isomorphism I to 

5^2 + (2 -K) J J[vt) = (xi,X2, . . ■ ,X2i(2'+i):2/i:2/2, ■ • ■ , J/ 2 ' ( 2 *-i-i) ) • Define an in- 
ner product < • I • >rp on Vt by multiplying corresponding real entries in two 
tableaux and then summing. Denote the associated norm by || • ||t- Then X is an 
isometry to Euclidean space IR ^ L A tableau (7) is said to be integrable 
when, for each nucleus, the weighted sum of its four neighbours vanishes. More 
specifically, for an integrable tableau: 



^(m— l)2Gn 4” yn2^+m ^rri2^+n y{n—l)2^+m 



(8) 



where 1 < m, n < 2b The integrable tableaux comprise a vector subspace V® of 
the vector space Vr of all tableaux. (8) coupled with dim{ker{L^)) = 2^(2* -|- 2) 
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yields dim{V®) = 2^(2* + 2). We shall prove now that one interation of 2-D 
Leap-Frog can be viewed as an affine mapping At : V® — > V® . Indeed 

Lemma 1. There is an affine transformation At : V® — > V® such that 

= AtVt, (9) 

where Vt = (for fc € IN^ denotes the representation (via I) of in the 

space Vt- The linear part of At (i.e. A^f) is constant over each iteration. 

Proof: With half-snapshot overlaps, one iteration of 2-D Leap-Frog consists of 
(2*+^-^ _ ^^2 snapshot optimizers. From formula (2) we infer that each snapshot 
optimizer defines an afhne map V® V®. Indeed (2) says v is an orthogo- 
nal projection of Vh (see section 2). The term u in (2) is linear in the inte- 
grable vector field. So the right-hand side of (5) is affine on integrable vector 
fields (depending on the position of the snapshot). Thus on each iteration 2-D 
Leap-Frog defines an afhne transformation (as a composite of Nq afhne map- 
pings) A({;") = A^(y^) -I- from the subspace of integrable vector helds to 
itself; A^ forms its linear component. Note that from (5) it is clear that only the 
linear component for each iteration remains unchanged. Hence the transforma- 
tion At{vt) = ^(^(^~^('^t))) = ^t('^t') + where Af^ = I o A^ oX~^ and 
X(rh") = Wt, yields (9). 

We shall now choose a special basis in the space V® . In doing so let Wm be 
the vector space of real (2* -|- 1) x (2* -|- 1) matrices. Think of the entries {oy} 
of such a matrix as values of the function u calculated at grid points labelled 
(i,j), where 1 < i,j < 2* -|- 1. Dehne a linear transformation <P : Wm — > Vt 
by mapping from function values to the corresponding discretised vector held, 
namely X = {aij)i<ij<n (where n = 2* -|- 1) and 



Y = 



ail — 021 



Oi2 — Oil 



022 — 021 



Ol2 — 022 



0(n-l)2 ~ 0(„_i)i 

0(n— 1)1 Oyji <> 0^^_ij2 a^2 

Ora2 — Oral 



Oira — 02ra 



O^ra— l)n Orara 






Notice that <P maps into the space of integrable tableaux, namely <P{Wm) C 
V® C Vt (see (8)). As easily verihed the kernel of T> is spanned by the constant 
matrix whose real entries are everywhere aij = 1 (for 1 < i, j < n). Consequently 
has rank (2* -|- 1)^ — 1 = 2^(2* -|- 2). As showed before this is the dimension 
of V®. So the image under of any (2* -|- 1)^ — 1 linearly independent matrices 
from Wm (which do not belong to ker{d>)) constitutes a basis of V® . In particu- 
lar, dehne = {h%h% . • . , h %, . . . , h®„, . . . , h® , h® 2 > ■ • ■ : ^®(n-i)}’ 

where h® = <T{Xn) (matrix Xu has Orai = 1 and zeros elsewhere), h ®2 = <P{Xi 2 ) 
(matrix A 12 has Qn 2 = 1 and zeros elsewhere),..., h®^ = ^(Xira) (matrix Aira 
has Orara = 1 and zeros elsewhere), h® = ^(^ 21 ) (matrix A 21 has 0(ra-i)i = 1 
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and zeros elsewhere), h ®2 = ^(-^ 22 ) (martix X 22 has 0 („_i )2 = 1 and zeros else- 
where),..., = <P{X 2 n) (martix X 2 n has = 1 and zeros elsewhere),..., 

h®i = <l>{Xni) (matrix Xni has an = 1 and zeros elsewhere), h ®2 = ^{Xn 2 ) (ma- 
trix Xn 2 has ai 2 = 1 and zeros elsewhere),..., and finally = ^(-^n(n-i)) 

(matrix X^^n-i) has ai(„_i) = 1 and zeros elsewhere). Then is a basis of the 
space V® of integrable tableaux. Let vt G Vt be a nonintegrable tableau corre- 
sponding to the noisy or digitized data for problem (4), and define for xt G V® 
the function / : IR'’ ^ IR (here s = 2*(2* -|- 2)), called performance index such 
that, for n = 2* -I- 1, we have 

/(a^) = ll = fixr), (10) 

(i,j)7S(n,n) 

where || • ||t is computed from the inner product < • | • >t on Vt and Xij G IR. 
Then Xo G IR^ is a critical point of / when, for x = X~^{xt) and v = 

f{xT) = \\At - VtWI = \\I-\xt) - X-\vT)r = II* - ttf ( 11 ) 

attains its (global) minimum at xto- Note that xto is an orthogonal projection 
i>T of Vt on V® . The importance of B® for 2-D Leap-Frog is due to the following 
lemma. 

Lemma 2. The vector xt G V® is not optimal if and only if 2-D Leap-Frog 
improves the performance index (10). 

Proof. No snapshot step of 2-D Leap-Frog increases /. To see this, note that once 
a given snapshot Sfj is reached, 2-D Leap-Frog improves generically internal 
values of while keeping the others fixed. In a rare situation if the boundary 
values coincide with those corresponding to Xo (and thus xto) no change is 
made. Hence (4) cannot be globally increased. Evidently 2-D Leap-Frog cannot 
improve the global optimum Xo- So 2-D Leap-Frog fixes Xq. On the other hand, 
if X (and thus xt) is not already optimal, the gradient of / at the corresponding 
point in T G IR^ is nonzero. Let (a, b) (here 1 < a, 6 < n and (a, b) yf (n, n), 
n = 2* -I- 1) be the first indices when (df/dxab)(v) yf 0 and when reached by 
step (* 0 ; jo) in 2-D Leap-Frog, where the corresponding basis element is in 
the interior of the snapshot (the existence of is justified in the closing 
section). Since / can be decreased by changing only one variable Vab, optimizing 
the snapshot gives a better performance index, which feeds through to the global 
tableau. 

We are ready now to complete the proof of Theorem 1 . Note that combining 
Lemma 1 and (10) with At(vt) = (see Lemma 2) yields \\xF®^ — ■wll = 
||*T+'-*t||t= ||AT(&?^)-fiT||T = 

\\Aif{x-T - vt)\\t < WA^WtWxt - vtWt = CP" - fill, 

where C = H^tII't = ■S''^T||q;t||=iII^t(®t)||t- R suffices to show C < 1 (it is 
essential here that the Aif is the same over each 2-D Leap-Frog iteration). By 
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Lemma 2 if x ^ v then \\At{xt) — vt\\t < \\xt — vt\\t, and by Pythagoras’ 
Theorem 

\\At{xt) — vt\\t < \\xt — vt\\t- (12) 

Let now St{vt, 1) be the unit sphere in the space V® of integrable tableaux cen- 
tered on the optimum vt- Defining g : St{vt, 1) — *■ IR as g{xTs) = W^ri^TS ~ 
■wt)||t and taking into account (12) we get for xts & St{vt, 1) (of course here 
xts ^ Vt) g{xTs) = \\^t{xts) ~ vt\\t < \\xts — vt\\t = 1- Note that g is 
defined on a compact set St{vtA) (as dim{V®) = 2^(2* -|- 2) < oo) and thus 
attains its supremum for some x*rs € St{vt, 1) which with g{xTs) < 1 yields 
a = sup{g{xTs) ■ Xts G <S'r('hT,l)} = g{x*Ts) < 1- The latter combined with 
(T = II Ay II T guarantees C < 1. Theorem 1 is proved. 

Of course, by (6) we infer ||t;"+^ — v\\ < C'"||?;^ — hH and thus as C < 1 is 
independent from n the convergence — > h holds. 

4 Experimentation and Conclusions 

We briefly describe the performance of 2-D Leap-Frog. Tests were run in Math- 
ematica.^ We took fc = 4 and I = 7: each subsquare consists of 16 x 16 pixel 
resolution and an image 17 consists of 128 x 128 pixels. In generating v, Gaussian 
noise was added, with mean zero and standard deviation 0.04, to the integrable 
gradient field Vgrad = (ux, Uy) obtained from the function Ui : [0, 1] x [0, 1] — > IR 
(for i = 1,2) defined as u\{x,y) = x'^ + 3xy -I- 2y^ {quadratic polynomial) 
and U 2 {x,y) = cos(20((a; — 0.5)^ -I- 2{y — 0.3)^)) {wavy function). Alternatively 
digitization of camera images can be performed, prior to computation of v. 2- 
D Leap-Frog was then applied with an a priori iteration bound set to no = 21. 
Lawn-Mowing [13] was used in the first iteration and then followed by no — 1 = 20 
2-D Leap-Frog iterations. For evaluation we introduce some auxiliary notions. 
Given the estimate of i) obtained after 21 iterations, define e = \\v — Vgrad\\'^, 
c = ||n — IP, d = \\vgrad — v^° IP- Pythagoras’ Theorem applied to the triangle 
A{v, v^°, Vgrad) yields the cosine of the angle between sides vv^° and VgradV^° 
as: cos(a) = Gonvergence of h" to v amounts to lim„^oo cos(a„) = 0. 

Denoting /3n = § — arccos(a„) as the angle deficiency, expressed in radians, 
upon completion of nth iteration of the 2-D Leap-Frog (/3„ should be close to 
zero) the following experimental results (each five-multiple iteration onwards) 
were obtained^: 

We close this section with some observations. 

— clearly k cannot be too big. Thus no iterations of the 2-D Leap-Frog take 
approximately (for large I, k < kmax fixed and I — k > 2) X = no (2*+^“^ — 
l) 2 ^ 22 (*-fe) ~ 4 j-jq times longer than Lawn-Mowing [13]. If however, I is small 

® Mathematica runs very slowly and further experimentation is needed with C in order 
to make comparisons with other methods. 

The (very high quality) ideal and reconstructed surfaces (with and without noise 
rectification) can be viewed at http : //www. cs .uwa. edu. au/~ ryszard/leap/pics/. 
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Table 1. Performance of 2-D Leap-Frog for iti and U 2 



Iteration number 


n = 1“ 


n = 2 


n = 6 


n = 11 


n = 16 


JlQ = 21 


d/ for Ml 


0.04256 


0.00298 


0.00050 


0.00028 


0.00018 


0.00013 


/3^ for U 2 


0.03786 


0.00286 


0.00081 


0.00033 


0.00018 


0.00011 



^ First iteration was Lawn-Mowing, but could be zero. 



and k is comparable with I i.e. when I — k = l or I — k = 2 then A = 9/4 or 
A = 49/16, respectively. In general, A = 4 — -|- 

— the removable noise or digitization errors are those from the orthogonal com- 
plement V® to the space of integrable tableaux. The unremovable propor- 
tion is approximately equal to {dim{V®)) / dim{VT)) = (2*(2*-|-2))/(2*+^(2*-|- 
1)) = i(l -I- 2 ^) « 0.5 (for I large). 

— 2-D Leap-Frog is related to classical algebraic methods in numerical analysis, 
as well as more recent work in nonlinear optimization [12]. In the special 
case k = 1 2-D Leap-Frog is Gauss-Seidel for a system with respect to our 
canonical basis. Alternatively Gauss-Seidel could be applied directly to the 
larger system (2) with respect to the standard Euclidean basis. For k > 1 is 
multiplicative Schwarz (see e.g. [1]). See also [4] for other methods for sparse 
linear systems. 

— in the proof of Theorem 1 a single coefficient variation Xab associated with 
changes only 4 points (or 3 or 2 if the corresponding snapshot forms an 
12-boundary snapshot) positioned inside (as fc > 1, a minimal snapshot 
size consists of 2x2 pixel block) . This is the reason for the use of our canonical 
basis B®. Examining this argument, we see that the size of the half snapshot 
overlap can be varied between 1 and 2^ — 1. 

— 1-D Leap-Frog solves a global problem, is iterative, converges regardless of 
the initial guess, works within the space of feasibile objects, has each step 
relatively small-scale, is amenable to the parallelism, and deals with nonlin- 
earities. 2-D Leap-Frog has these features also, except that in the present 
paper there is no need to deal with nonlinearities. 



References 

1. Benzi M., Nabben R., Szyld D. (2000) Algebraic theory of multiplicative Schwarz 
methods. Tech. Report 00210 Dep. Math., Temple Uni., Pliladelphia, USA, 
http: //www. math. temple . edu/~ szyld/. Numerische Mathematik. In press 354, 
363 

2. Buzbee B. L., Golub G. H., Nielson C. W. (1970) On direct methods for solving 
Poisson’s equations. SIAM J. Numer. Anal. 7, (4):627-656 353 

3. Frankot R. T. Chellappa R. (1988) A method of enforcing integrability in shape 
from shading algorithms. IEEE 10, (4):439-451 352, 353, 356 



364 Lyle Noakes and Ryszard Kozera 



4. Hackbush W. (1994) Iterative Solution of Large Sparse Systems of Equations. 
Springer, New York, Heidelberg, Berlin 363 

5. Horn, B. K. P. (1986) Robot Vision. McGraw-Hill, New York Cambridge, MA 352, 
353 

6. Horn B. K. P. (1990) Height and gradient from shading. Int. J. Comp. Vision 5, 
(l):37-75 352, 353, 356 

7. Horn B. K. P., Brooks M. J. (1989) Shape from Shading. MIT Press, Cambridge, 
MA 352, 353 

8. Kaya C. Y., Noakes L. (1998) A Leap-Frog Algorithm and optimal control: theo- 
retical aspects. In: Caccetta L., Teo K. L., Slew P. F., Leung Y. H., Jennings L. S., 
Rehbock V. (Eds) Proc. 2nd Int. Con. Optim. Tech. AppL, Perth, Australia, July 
1- July 3 1998. Curtin Uni. of Technology, 843-850 354 

9. Klette R., Schliins K. R., Koschan A. (1998) Computer Vision - Three Dimensional 
Data from Images. Springer, Singapore 353 

10. Kozera, R. (1991) Existence and uniqueness in photometric stereo. Appl. Math. 
Comput. 44 , (1):1-104 352, 353 

11. Kozera R. (1992) On shape recovery from two shading patterns. Int. J. Patt. Rec. 
Art. Int. 6, (4):673-698 352, 353 

12. Noakes L. (1999) A global algorithm for geodesics. J. Math. Australian Soc. Series 
A. 64 , 37-50 352, 354, 363 

13. Noakes L., Kozera R., Klette R. (1999) The Lawn-Mowing Algorithm for noisy 
gradient vector fields. In: Latecki L. J., Melter R. A., Mount D. M., Wu A. Y. 
(Eds) Proceedings of SPIE Conference, Vision Geometry VIII, Denver, USA, July 
19-July 20 1999. The International Society for Optical Engineering, 3811:305-316 
352, 353, 356, 362 

14. Noakes L., Kozera R. (1999) A 2-D Leap-Frog Algorithm for optimal surface recon- 
struction. In: Latecki L. J., Melter R. A., Mount D. M., Wu A. Y. (Eds) Proceedings 
of SPIE Conference, Vision Geometry VHI, Denver, USA, July 19-July 20 1999. 
The International Society for Optical Engineering, 3811:317-328 352 

15. Simchony T., Chellappa R., Shao M. (1990) Direct analytical methods for solv- 
ing Poisson equations in computer vision problems. IEEE Trans. Patttern Rec. 
Machine Intell. 12, (5):435-446 352, 353 



On Approximation of Jordan Surfaces in 3D 



Fridrich Sloboda and Bedrich Zatko 

Institute of Control Theory and Robotics, Slovak Academy of Sciences 
Dubravska 9, 842 37 Bratislava, Slovak Republic 
utrrzatkOsavba. sk 



Abstract. A polyhedral approximation of closed Jordan surfaces is de- 
scribed. The approximation is based on the notion of a relative convex 
hull in a polyhedrally bounded compact set obtained by gridding tech- 
nique and can be applied also to approximation of surfaces of functions. 



1 Introduction 

Measures are of importance in mathematics in two different ways. Measures can 
be used for estimating the size of sets, and measures can be used to define inte- 
grals. Borel, Lebesgue, Peano and Jordan associated to each set of points A an 
outer measure m*A and an inner measure mt:A. A set of points A is called 
measurable if m* A = m^A. Caratheodory [2] defined a p-dimensional mea- 
sure in a q-dimensional space for p, q integer. Hausdorff [11] generalized the 
Caratheodory’s p-dimensional integer measure for p real and so defined also frac- 
tal dimensions. For example the fractal dimension of the Cantor’s dust is • 
Hausdorff measures have played a dominant role in the development of surface 
area theory and, in particular, in the theory of minimal surfaces [4,5,7,11]. 

Let A be an arbitrary set of points in q-dimensional Euclidean space, g > 1. 
Let C/i, C/ 2 , • ■ • be a sequence of countable sets of points (closed or open) of the 
property 

a) ACUfeC/fc 

b) d{Uk) < e, fc = 1,2, . . ., where d{.) denotes the diameter of Uk- 
Let us denote 

OO 

inf^4^^=L(P^ p=l,2,..., 

for all Uk, k = 1 , 2 ,..., of the property a) and b), where 4^^ denotes the p- 
dimensional diameter of Uk- Then 

lim L^P^A = *L^P^A 

is called the outer p-dimensional Caratheodory measure. If p = 1 the measure 
is called linear, if p = 2 it represents the 2-dimensional area (surface area) of A. 
If Uk are balls with diameter e then the 2-dimensional diameter oi Uk equals 
l;7re^. In the next we will use the following notation 

*L^^^A = {A), 
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where H‘^{A) denotes the 2-dimensional Hausdorff measure of A, i.e., the surface 
area. 

The first topological definition of the surface area was given by Minkowski 
in 1901 [9]. Let S <Z B? he a, two-dimensional continuum homeomorphic to the 
unit sphere. The surface area of S, s{S), is according to H. Minkowski defined 
as 



s(S) = lim 
< 5^0 



V{Ms,s) 

2S 



provided that the limit exists, where V{.) denotes the volume of (.), and for 

(5 > 0, e S' 

= [J Us^y, 



Us,y := {x e I dist2(?/,x) < <5}. 



Since 1918 a number of other approaches to surface area definitions have 
been published. Unlike the curve length definitions, which are all equivalent, 
the surface area definitions are not equivalent [8]. They are equivalent only for 
a subclass of all surfaces, for example for sufficient smooth surfaces. The most 
general surface area definition is the Caratheodory-Hausdorff definition. 

The 2-dimensional Caratheodory outer measure defines the surface area but 
it does not allow to visualize the surface itself. The most efficient way how 
to visualize the surface itself is to find its polyhedral approximation. A two- 
dimensional continuum in homeomorphic to the unit sphere is called a closed 
Jordan surface, and a two-dimensional continuum in homeomorphic to the 
unit square is called a Jordan surface. An efficient polyhedral approximation of 
closed Jordan surfaces which with arbitrary accuracy approximates the surface 
area of a given closed Jordan surface is still an open problem. The first attempt 
to a polyhedral approximation of closed Jordan surfaces was made by C. Jordan 
by the end of the 19-th century. This approach was based on a triangulation 
method. But only few years later Peano and Schwarz showed that this approach 
is not functional even for a cylinder. 

In this paper a new approach to polyhedral approximation of closed Jordan 
surfaces is described. It is based on the notion of a relative convex hull related 
to a polyhedrally bounded compact set obtained by gridding. The notion of a 
relative convex hull represents a generalization of the notion of a convex hull. The 
relative convex hull related to a polyhedrally bounded compact set represents 
the smoothest polyhedral approximation of closed Jordan surfaces and can be 
applied also to approximation of surfaces of functions. It enables an efficient 
representation and visualization of surfaces and has applications in computer 
aided geometric design, computer graphics and image processing. 



2 Polyhedron 

Polyhedron in R^ is a simply connected compact set bounded by a surface home- 
omorphic to the unit sphere which is represented by union of convex faces. A face 
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is a simply connected compact set bounded by a simple closed convex polygonal 
curve and consists of vertices and edges. 

According to Euler for a polyhedron it holds 

v-e + f = 2, 

where v,e,f denotes the number of vertices, edges and faces, respectively. 

There are three types of vertices of a polyhedron: convex, concave and saddle 
vertices. Let v G he a, point and let tt C be a plane, such that v G tt. Let 
for (5 > 0 

M{v,tt,S) := {x G tt C \ dist2(x, i>) < <5}. 



Definition 1. A vertex v of a polyhedron Ps is called a convex vertex of Ps if 
there exists a plane tt and <5 > 0, such that 

M{v,tt,5) \ {-c} C °Ps, 

where denotes the complement of Ps ■ 

Definition 2. A vertex v of a polyhedron Ps is called a concave vertex of Ps if 
there exists a plane tt and (5 > 0, such that 

M{v,tt,S) \ {t;} C Pg, 

where Pg denotes the interior of Pg ■ 

Definition 3. A vertex v of a polyhedron Pg is called a saddle vertex of Pg if 
there does not exist a plane tt and ^ > 0, such that 

M{v,tt,6)\{v} C‘^Pg, or M{v,tt,S)\{v} C Pg. 

According to the Gauss-Bonnet theorem the global curvature of a surface 
homeomorphic to the unit sphere equals 47t. In the case of a polyhedron the 
curvature is related to the vertices of the polyhedron only. With each vertex of 
a polyhedron is associated an angle. This angle is defined as the sum of angles 
between neighbouring edges related to a vertex in radians. With each vertex of 
a polyhedron is associated a curvature which is defined as 2tt — a, where a is 
the angle related to the vertex. A convex vertex of a polyhedron not necessarily 
possesses a positive curvature and a polyhedron which possesses convex vertices 
only is not necessarily convex, see Fig. 1. 

For further considerations we are introducing the following 

Definition 4. Let e be an edge of a polyhedron Pg and let y he a point of e, 
y G e. We say that e is convex if there exists a straightline I : y G I and <5 > 0 
that 

M,iy,l,6)\{y}c^Pg, 

where 

Mi{y, I, S) := {x G I C R^ \ dist 2 {x, y) < <5} 
and ^Pg denotes the complement of Pg. 
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Fig. 1. A polyhedron which possesses convex vertices only and itself is not convex 



Similarly also a concave edge is defined. Now it holds that a polyhedron 
which consists of convex edges only is convex. 

In the following we will consider isothetic polyhedrons. Isothetic polyhedrons 
have the property that they possess only faces which are parallel with xy-, xz-, 
zy-plane, respectively. 

For an isothetic polyhedron it holds that its convex, concave vertices possess 
positive curvature which equals tt/2 and there are only two types of saddle 
vertices and these possess — 7t/2, — tt curvature, respectively. 

3 Relative Convex Hull 

In the following the notion of a relative convex hull will be introduced and its 
properties are summarized. 

Definition 5. Let U C i?" be an arbitrary set. A set C C U is said to be 
U-eonvex iff for every x,y G C sueh that xy (fU it holds that xy C C . 

A C U is t/-convex iff for every convex set C C U the set C n A is convex. 
Any convex A C U is [/-convex. If U is convex then A C U is [/-convex iff A 
is convex. U itself is [/-convex. If A C B C U and A is [/-convex then A is 
also B-convex. 

Let A C B C U C R^, and dA, dB, dU be homeomorphic to the unit sphere. 
Let A, B be [/-convex and H‘^{dU) < oo. Then 

H'^idA) < H‘^{dB) < 

If U = and H^{dB) < oo then A, B are convex and 

H'^{dA) < H'^idB) < oo, 
which follows from the theory of convexity [6,9]. 

Definition 6. Let V C U C i?" be given. The intersection of all U-convex sets 
containing V will be termed U-convex hull ofV and denoted by convuiV). 
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Fig. 2 . V,U C and CHu{V) 



For the relative convex hull convu(V) it holds the following: 

a.) V C convjj{V) C C/ n conv{V)^ 

h) V = convu{V) holds iff V is i7-convex, 

c) convjj{V) = conv{V) iff conv{V) C U . 

Let V C U C and dV,dU be homeomorphic to the unit sphere. Further 
let H‘^{dU) < oo. Then 

H^{dconvu{V)) = min ^ ^ {H^{dA)}, 

A t7-convex containing V 

which is the property of minimality of the relative convex hull. Because U C R^ 
itself is U -convex by definition and V <ZU it holds that 

H^{dconvu(V)) < H^{dU). 

Let V,U C R^ be polyhedrons such that V C U°. Then dconvu{V) is a 
polyhedral surface. The vertices of conv{V) belong to the vertices of convu(V). 
But not necessarily all vertices of convu{V) belong to the vertices of V, U, 
respectively. There might exist vertices of convu(V) which are lying on an edge 
of F, U, respectively. 

Let V,U C R? be polygons such that V C U° and let A C be a simply 
connected compact set. Then for convjj{V) it holds 

a) (dconvuiV)) = min H^ldA) 

dAcu\v°, VGA 

b) convjj{V) is a polygon, so called minimum perimeter polygon, 
whereby dconvuiV) = CHuiV) is called a stretched thread, see Fig. 2. 

According to a) dconvu{V) is the shortest Jordan curve in G := t/\F° encir- 
cling V. The properties of the shortest path in a polygonally bounded compact 
set are described in [12]. 
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4 Polyhedral Approximation 



In the following we will consider a polyhedral approximation of surfaces homeo- 
morphic to the unit sphere in R^. The approximation is based on the method of 
shrinking and on the notion of the relative convex hull. The method of shrinking 
could be also called a method of compression and it is related to a well-known 
property of nested set of continua [10]: 

Lemma 1: Let 

C1DC2D ...DC^D ... 



be a set of continua in R^. Then 



C = f|C'„ i = l,2,... 

i 

is a continuum, i.e., a connected compact set. 

Suppose that C = S C R^ is a two-dimensional continuum homeomorphic to 
the unit sphere with H'^{S) < 00 and 

Q = z=l,2,..., 

where P 2 *\ are polyhedrons such that C P 2 *^°. The approximation 
of the surface area of S homeomorphic to unit sphere in R^ is related to the 
following 

Theorem 1: Let M C R^ and let dM = S' be a two-dimensional continuum 
homeomorphic to the unit sphere with H^{S) < 00. Let 

00 

Go D Gi D G 2 3 . . . D G, D . . . D S, S = f| G„ 

where Gi := \ P^^\ P^^ are polyhedrons such that 

a)P«CPi*^°, i = 0,1,2,... 

h) lim sup P^(9P2*^) < cxD. 

i 

Then 

lim H^{CHpi,,{P^^)) = H\S), 

i — >oo ■‘2 

where CH = dconv p{i){p['‘'^). 

Comment 1: For the relative convex hull boundary GPp(t)(p|*^) it holds that 

^2 

a) P2(GP „(P«)) < H\S) or ij2(GP „(P«)) > H\S) 

^2 ^2 

b) < P2(GP „(P«) < limsupp2(aPi*^), 

^2 2 



On Approximation of Jordan Surfaces in 3D 



371 



(i) 

where denotes the minimal area surface homeomorphic to the unit sphere 
in Gi := P 2 ^ \ Pi'^° containing P^\ For it holds [3] 

I 

The relative convex hull boundary CHp(i) (Pj^^) can be called as stretched elastic 
surface. 

Comment 2: Theorem 1 for the two-dimensional case is proved in [13]. 

5 Method of Gridding 

The method of gridding enables to realize the above described general approach 
to surface area approximation of surfaces in B? , to represent and to visualize 
them. Regular grids represent a theoretical tool of set theoretic topology and 
mathematical analysis. The grid was introduced by C. Jordan and G. Peano 
by the end of the 19-the century in order to define measurable sets. In three 
dimensional space the most important grid is the orthogonal one. More formally, 
for p = 0, 1, 2, . . . , and for each triple {wi, W 2 , ws) of integer numbers let 

e I <x.< {w. + 1) 2-P, *=1,2, 3} 

represents the topological unit of an orthogonal grid. Let M C be a compact 
set bounded by a smooth closed Jordan surface S, dM = S. Let us denote 

~Mp := {Nf ,\Nf , C M°} 

^ ^ {Wi,W2.,W3) ' {Wi,W2,'W3) J 

P ^ {Wi,W2,W3) I {Wi,W2,W3) ' J 

Nf , elements which belong to ~M^ will be called inner elements of M 

and elements for which Nf , n dM ^ 0 will be called boundary elements 

of M, see Fig. 3. 

M is Jordan measurable if 

I{M) = inf = sup I(~Mp), 

p p 

where /(.) denotes the volume of (.). If M is Jordan measurable it does not 
imply that H‘^{dM) < 00. We say that ~^Mp, ~ Mp are related to a proper grid 
if for some po > 0 

a) ^Mp, ~ Mp for p = po,Po + 1, • ■ are simply connected compact sets 

b) for € +Mp \ -M°, p = po,po + 1, . ■ n 5 is a simple 

closed Jordan curve or a single point. 

~^Mp, ~ Mp related to a proper grid possess a proper polyhedral representation 
of their boundaries, i.e., ^Mp, ~ Mp with proper polyhedral representation are 
polyhedrons, so that they do not possess collinear vertices, see Fig. 4. 
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Fig. 3. Mp and ^Mp related to an M C 




Let us denote by 

Gp:=^Mp\~Mp, p = po,po + l,.... 

According to the definition of ~^Mp, ~ Mp, respectively, it holds 

a) ~Mp C +M° 
b) Gp A Gp+i A 5 

OO 

c) 5= n Gp. 

P=PO 

For S = dM C smooth H^{S) < oo and limp sup < oo, so that 

Theorem 1 applies. Fig. 7 shows ~Mp related to an M C R^, ~^Mp, CH{~ Mp) 
and CH+m^{~ Mp). Fig. 8 shows ~Mp related to an M C R^, GH{~Mp) and 
GH+M,i-Mp). 

Comment 3: If M C R^ is convex then 

a) dconv+M^ir Mp) = CH+m^{~ Mp) = CH{~ Mp) 
b) H\GH{-Mp)) < H\CH{-Mp+i)) 
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c) lim H^{CH{~Mp)) = H^{S), 

p — >-00 

for p = po,po + 1, . . which follows from the theory of convexity [6,9]. 

Comment 4: Let S : z = f{x,y), H^{S) < oo, where / > 0 is defined on a 
square D related to a regular orthogonal grid. Then ~ Mp related to a proper 
grid is a simply connected compact set, ~ Mp C ^Mp, and CH+m i~Mp) is a 
polyhedral surface homeomorphic to the unit sphere which approximates also 
the surface S : z = f{x, y) for (x, y) € D, see Fig. 9 and Fig. 22. There is an 
one to one map between a digital image and its 3D grid model. The gray-level 
related to a pixel represents the number of voxels in the corresponding column 
of its 3D model denoted by ~ Mp and vice versa, see Fig. 22. ~^Mp is obtained by 
bordering of ~ Mp by voxel elements so that ~ Mp C ~^Mp and the corresponding 
CH+^^(~Mp) represents also a polyhedral approximation and representation of 
the surface related to a digital image. 

6 A Method for the Relative Convex Hull Calculation 

Let ^Mp, ~ Mp C be polyhedrons defined by union of elements 

such that ~ Mp C ~^Mp. A set theoretic method for the relative convex hull 
conv+Mp{~Mp) calculation is defined as follows 

Step (1): Set Aq := ~ Mp and define 

Ak ■■= U fc = 1,2, . . . . 

x,yCAk^i, aPyC + Mp 



Step (2): The relative convex hull conv+Mp{ Mp) is defined as 

S 

conv+Mp{~ Mp) = (J Afc, 
k=0 



whereby Mp = Aq C Ai C A 2 C . . . C = Ag+i, i.e., the relative convex hull 
is defined by finite number of steps. 

7 Implicit Surfaces 

Implicit surfaces become interesting in computer aided geometric design and 
computer graphics. The regular orthogonal grids represent the most popular 
tools used by methods for representation and visualization of implicit surfaces [1] . 
Besides regular orthogonal grids also adaptive grids are used. A commonly used 
method for representation of implicit surfaces is based on so called polygonization 
techniques [1]. 

Let 



F{q) = 0, 
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be a smooth two-dimensional continuum homeomorphic to the unit sphere, the 
boundary of a compact set M d B?, dM = F{p) =0. A point q G for which 
sign F{q) = — 1, sign F{q)= -|-1 is called inner, outer point of M, dM = A(q) = 0, 
respectively. 

Suppose that for p = po,po -I- 1, . . ., ~^Mp, ~Mp related to M c R^, dM = 
F{q) = 0, defined in the previous section, are related to a proper grid. For the 
bounary elements 



for which n (F{q) = 0) is 

a) a simple closed Jordan curve which does not cut an edge of or 

b) a point which is not a vertex of 

it holds that all their vertices, Vi, i = 1,2, . . . ,8, have the same sign F{vi) yf 0, 
see Fig. 5. 

However, the relative convex hull boundary CF[+ f^^{~ Mp) cannot intersect 
these boundary elements, so that they need not be considered for 

the Mp) calculation. For all other boundary elemetns G 

^Mp \ ~ Mp it holds that not all their vertices, Vi, i = 1,2,..., 8, have the 
same sign F{vi), so that the set Gp := ~^Mp \ ~ M° in which the boundary 
of the relative convex hull lies is well defined. CFI+m Mp) calculation does 
not require numerical root finding to locate polygon vertices on edges of each 
boundary element as the polygonization methods do require [1]. 

In case of approximation of planar implicit curves homeomorphic to the unit 
circle on orthogonal grids the boundary of the relative convex hull is a poly- 
gonal curve whose vertices belong to convex, concave vertices of ~ Mp, ~^Mp, 
respectively, so that they belong to the grid points which are represented by 
integer coordinates only. The corresponding algorithm for CF[+j^ Mp) calcu- 
lation is based on integer arithmetic and has linear time complexity [12], see 
Fig. 6 which shows the approximation of F{x, y) = {x^ + y^)^ — 9 {x'^ + y'^) = 0 
by CH+mX ' Mp) related to 512x512 grid point resolution. 

In case of approximation of convex implicit surfaces on orthogonal grids the 
boundary of the relative convex hull CF[+m^{~ Mp) = CH{~Mp), so that it 



Fig. 5. Boundary elements whose all vertices Vi, i = 1,2, ... ,8, have the same 
sign F{vi) 
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is again represented by grid point vertices with integer coordinates only, see 
Fig. 10 - Fig. 17. The polygonization methods do require numerical root finding 
procedure whereby the resulting polyhedron is not necessarily convex [1]. 

Fig. 18 and Fig. 20 show ~Mp related to non-convex implicit surfaces and 
Fig. 19 and Fig. 21 show the corresponding CH{~Mp). Fig. 22 shows ~Mp 
related to a surface of a function and Fig. 23 shows the corresponding CH{~Mp). 

8 Conclusion 

A new approach to polyhedral approximation of closed Jordan surfaces is de- 
scribed. The approximation is based on the notion of a relative convex hull in 
a polyhedrally bounded compact set obtained by gridding technique. The rela- 
tive convex hull enables an efficient representation and visualization of surfaces 
and has significant applications in computer aided geometric design, computer 
graphics and image processing. 
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Fig. 7. -Mp, +Mp, CH{-Mp) and 
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Fig. 9. ■ A/p related to a surface of a functioiij CH{^ A/p) and C H-\- /V/p)^ 
whereby ^Mp is defined by bordering of ~ Mp by elements so that 

-Mp C +A/p° 
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Fig. 10. Mp related to F{x, y, z) = + y'^ + — 1 = 0 




Fig. 11. CH+m^{ Mp) = CH{ Mp) related to the previous figure, whereby 
is defined by bordering of ~ Mp by elements so that ~ Mp C 
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Fig. 12. Mp related to F{x, y,z)=x‘^ + y‘^ + z‘^ — l = 0 which corresponds to 
a higher grid point resolution 




Fig. 13. CH+Mp{~ Mp) = CH{~ Mp) related to the previous figure, whereby 
is defined by bordering of ~ Mp by elements so that ~ Mp C 

+M° 
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Fig. 14. Mp related to F{x,y,z) = (cr^ + + (z + 0.3)^)^ — 0.18(— — 5.0i/^ + 

z^)+ +o.3‘‘ - O.Sl-* = 0 




Fig. 15. CH+m^{ Mp) = CH{ Mp) related to the previous figure, whereby 
+Mp is defined by bordering of ~ Mp by elements so that ~ Mp C 



On Approximation of Jordan Surfaces in 3D 



383 




Fig. 16. Mp related to F(x, j/, z) = + 0.5 + 0.2 — 1 = 0 




Fig. 17. CH+Mp{~ Mp) = CH{~ Mp) related to the previous figure, whereby 
is defined by bordering of ~ Mp by elements so that ~ Mp C 

+M° 
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Fig. 18. Mp related to F{x,y,z) = + (z + 0.06)^)^ — 0.7688(— — 

y‘^ + z‘^)+ +0.624 - o.ea-* = 0 




Fig. 19. CH{ Mp) related to the previous figure, whereby is defined by 
bordering of ~ Mp by elements so that ~ Mp C ~^M° 
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Mp related to F{x, y, z) = {x^ + + z'^)^ — + y'^ + z'^) = Q 




Mp) related to the previous figure, whereby is defined by 
Mp by elements so that ~ Mp C +M° 
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Fig. 23. CH{ Mp) related to the previous figure, whereby is defined by 
bordering of ~Mp by elements so that ~ Mp C ~^M° 
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Abstract. We study the computation of rotation-invariant similarity 
measures of convex polyhedra, based on Minkowski’s theory of mixed 
volumes. To compute the similarity measure, a (mixed) volume func- 
tional has to be minimized over a number of critical orientations of these 
polyhedra. These critical orientations are those relative configurations 
where faces and edges of the two polyhedra are as much as possible par- 
allel. Two types of critical orientations exist for two polyhedra A and B. 
Type-1 critical orientations are those relative orientations where a face 
of B is parallel to a face of A, and an edge of B is parallel to a face 
of A, or vice versa. Type-2 critical orientations correspond to the case 
that three edges of A are parallel to three faces of 5, or vice versa. It 
has been conjectured that to perform minimization of the volume func- 
tional, it is sufficient to consider Type-1 critical orientations only. Here 
we present experimental proof showing this conjecture to be false. 



1 Introduction 

Shape comparison is one of the fundamental problems of machine vision. Shape 
similarity may be quantified by introducing a similarity measure. The require- 
ment of invariance under some set of shape transformations in general leads to 
complicated optimization problems. Therefore, one often studies shape classes 
and transformation sets for which a compromise between generality and effi- 
ciency can be found. 

Recently, a new approach to similarity measure computation of convex poly- 
hedra has been developed based on Minkowski addition [5,2]. These similarity 
measures are based upon the Minkowski inequality and its descendants, and 
the central operation is the minimization of (mixed) volume functionals. An 
attractive property of this family of similarity measures is that they are invari- 
ant under translations and possibly under scaling, rotation, and reflection. The 
method may be used in any-dimensional space, but we will concentrate on the 
3D case. 

For computing a rotation-invariant similarity measure of two convex polyhe- 
dra, a (mixed) volume functional has to be evaluated over a number of special 
relative orientations of these polyhedra, the so-called critical orientations. These 
critical orientations are those relative configurations where faces and edges of 
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the two polyhedra are parallel as much as possible. (Two faces are called paral- 
lel when they have the same outward normal.) Given two polyhedra A and B, 
the set of critical orientations can be divided in two classes, denoted by Type 1 
and 2, respectively. Type 1 occurs when a face of B is parallel to a face of A, 
and an edge of B is parallel to a face of A, or vice versa; Type 2 occurs when 
three edges of A are parallel to three faces of B, or vice versa. It was proved 
in [-5] that (i) for a given rotation axis, it is sufficient to compute the (mixed) 
volume functionals only for a finite number of critical angles, thus generalizing 
a result for the 2D case [2]; and (ii) the number of rotation axes to be checked 
is finite. The second result is trivial for Type-1 orientations, but the proof for 
Type 2 is more involved, and only establishes finiteness of the number of axes 
to be checked, without giving an explicit upper bound on the number of axes. 
Such an explicit upper bound for the number of Type-2 critical orientations was 
given in [1], where it was shown that the problem can be reduced to solving 
an algebraic equation of degree 8, which has to be solved numerically. So, given 
three edges of A and three faces of B, the number of critical orientation axes is 
at most 8. 

Experiments on mixed-volume minimization were reported by Tuzikov and 
Sheynin in [6]. Only Type-1 critical orientations were taken into account, and 
the authors conjectured that to find the global minimum of the mixed- volume 
functional it is sufficient to consider Type-1 critical orientations only. In this 
paper we reconsider this issue, and present experimental proof showing this con- 
jecture to be false. Experiments were carried out with randomly generated pairs 
of tetrahedra, and the minimum of the mixed volume functional was computed 
by taking into account either Type-1 or Type-2 critical orientations. As a re- 
sult, we have found that the minimum value of the mixed volume functional 
for Type-2 minimization can be larger as well as smaller than that of Type-1 
minimization. 

The paper is organized in the following way. In Section 2 we define Minkowski 
addition of convex polyhedra and their slope diagram representation, and intro- 
duce a rotation-invariant similarity measure based on inequalities for the (mixed) 
volume. In Section 3 similarity measure computation by minimization of a mixed- 
volume functional is considered, and the main results from the literature are 
summarized. In Section 4 we give experimental results on minimization of the 
mixed- volume functional, by taking into account Type-1 and Type-2 critical 
orientations, respectively. Conclusions are summarized in Section 5. 

2 Preliminaries 

In this section the Minkowski sum, mixed volumes, a similarity measure based on 
the Minkowski like inequality, and the slope diagram representation of convex 
polyhedra are introduced. The compact convex subsets of are denoted by 
C = C(R^). Two shapes A and B are said to be equivalent if they differ only by 
translation; we denote this as A = B. 
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2.1 Minkowski Sum and Mixed Volumes 

The Minkowski sum of two sets A, B C is defined as 

A (B B = + b\ci G A^ b G BJ. (1) 

It is well known [4] that every convex set A is uniquely determined by its 
support function, given by: 

m) = sup{(a, u) I a € A}, uGS'^. 

Here (a, u) is the inner product of vectors a and u, and denotes the unit 
sphere in Also [4]: 

h{A(B B,u) = h{A,u) + h{B,u), u G , (2) 

for A,BgC. 

Denote by V (A) the volume of the set A C Given convex sets A, B CM.^ 
and a, (3 >0, the following holds: 

V{aA © pB) = a^V{A) + Sa^pV{A, A, B) + 3a/3V(A, B, B) + P^V{B). (3) 

Here V{A,A,B) and V{A,B,B) are called mixed volumes. 

The Minkowski inequality for convex sets A, B G C(R^) reads [4] 

V{A, A, Bf > V{AfV{B), (4) 

where equality holds if and only if i? = A A for some A > 0. 



2.2 Similarity Measure 



Using the Minkowski inequality (4), a similarity measure cr may be defined as 
follows: 



cr(A, B) = sup 



U(H)3U(A)3 



= sup 



U(H)3U(A)3 



rSiV{B,B,R{A)) n^lkV{A,R{B),R{B)) 



( 5 ) 



where TZ denotes the set of all spatial rotations, and where R{B) denotes a rota- 
tion of i? by i? G 7?.. The second equality follows from the fact that V {B, B, R{A)) 
= V{R{A),B,B) = V {A, R~^ {B) , R~^ {B)) . Obviously, 0 < a{A,B) < 1, where 
(t(A, B) = 1 when B = \R{A) for some rotation R and some A > 0. The simi- 
larity measure cr is invariant under rotations and scalar multiplications. It is not 
symmetric in its arguments. Symmetric versions may be defined in various ways; 
an example is the measure cr'(A, B) = i(cr(A, B) + a{B, A)). 

To find the maximum in (5), the mixed volume V {A, R{B) , R{B)) has to be 
minimized over all orientations of A. 

If i? is a convex polyhedron with faces Fi and corresponding outward unit 
normal vectors Ui, i = 1, . . . , k, then [4] 



U(A, B, B) = V{B, A) = i ^ h{A, Ui)S{FP, 



(6) 



where S{Fi) is the area of the face Fi of B and h{A,Ui) is the value of the 
support function of A for the normal vector Ui. 
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2.3 Slope Diagram Representation 

Denote face i of polyhedron A by Fi{A), edge j by Ej{A), and vertex k by Vk{A). 
The slope diagram representation (SDR) of polyhedron A, denoted by SDR(A), 
is a unit sphere covered with spherical polygons. A vertex of A is represented 
by the interior of a polygon on SDR(A), an edge by a spherical arc on SDR(A), 
and a face by a vertex of some polygon on SDR(A). To be more precise: 

— Face representation. Fi{A) is represented on the sphere by a point 
SDR(Ti(A)), located at the intersection of the outward unit normal vec- 
tor Ui on Fi{A) with the unit sphere. 

— Edge representation. An edge Ej(A) is represented by the arc of the great 
circle connecting the two points corresponding to the two adjacent faces 
of A, (A). 

— Vertex representation. A vertex 14 (A) is represented by the interior of the 
polygon bounded by the arcs corresponding to the edges of A meeting 
at 14(A). 

In Fig. 1 an example of a polyhedron and its SDR is given. 

It is easily verified that the faces Fi(A) and Fj{B) are parallel (that is, 
have the same outward normal) when SDR(Fi(A)) coincides with SDK{Ej{B)). 
Also, an edge Ei{A) is parallel to Fj{B) when SDR(Fj(i?)) lies on SDR(Ai(A)). 
Therefore, the maximum in (5) is obtained when points of SDR(A) coincide with 
points or edges of SDR(R). 

3 Similarity Measure Computation 

In this section, we consider the problem of computing the similarity measure (5). 

Let i be an axis passing through the coordinate origin and r^^a be the rotation 
in about I by an angle a in a counter-clockwise direction. The problem to be 
considered is the minimization of the functional V{A,re^a{B),re^a{B)). Given a 
fixed axis i and angle a, (6) can be used to compute this functional. 

While rotating the slope diagram of polyhedron A, situations arise when 
spherical points of the rotated SDR of A intersect spherical arcs or points of the 
SDR of B. Such relative configurations of A w.r.t. B are critical in the sense 
that they are candidates for (local) minima of the objective functional to be 
minimized. For more precise definitions we refer to [5]. 

3.1 Fixed Rotation Axis 

Let be a fixed rotation axis. The ^-critical angles of B with respect to A for 
mixed volume V{A,re^a{B) , re.a{B)) are the angles {Oj}, 0 < < a '2 < ■ . ■ < 

< 27t, for which spherical points of the rotated slope diagram SDR(r^^„/ {B)) 
intersect spherical points or arcs of SDR(A). The following result was proved 
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Fig. 1. (a): A tetrahedron with unit normal vectors on its faces, (b): Its slope 
diagram representation 



Proposition 1. Given an axis of rotation i, the mixed volume of the con- 
vex polyhedra A and B, i.e. V{A^ri^aiB),ri^ciiB)), is a function of a which 
is piecewise concave on [0,27t), i.e., concave on every interval (cr^, cr^+i); for 
k = 1,2, . . . , N and 

This result implies that in order to minimize the mixed volume for any fixed 
rotation axis £, it is sufficient to compute it for all ^-critical angles (which are 
clearly finite in number), and take the minimum of the values thus obtained. 

3.2 Minimization over All Rotation Axes 

An extensive analysis in [5] showed that two types of critical orientations 
have to be considered for obtaining the global minimum of the mixed volume 
V{A,R{B),R{B))-. 

Type 1 A face of A is parallel to a face of B, and an edge of A is parallel to a 
face of B. 

Type 2 Three edges of A are parallel to three faces of B. 

To find the orientations of Type 1 is trivial. When a face Fj{B) is parallel to 
a face Fi{A), B has only one degree of freedom left, being a rotation around 
an axis through the origin and the spherical point SDK{Fj{B)). Using the slope 
diagram representations of A and B, it is easy to find those rotations of B around 
this axis that make the slope diagram representations of faces of B coincide with 
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the slope diagram representations of edges of A. The problem can be restated 
as solving a quadratic equation in one variable [6] . 

To find the orientations of Type 2 means looking for those orientations of B 
where three points on SDR(iJ) (representing three faces of B) lie on three spher- 
ical arcs of SDR(A). (Notice that no more than three points have to be checked, 
since a rotation is uniquely determined by three parameters.) This problem can 
be reformulated as follows: given two triples of 3D vectors a,b,c and k,l ,m, 
find the rotation R that transforms the vectors a,b,c such that R{a) is per- 
pendicular to k, R{b) is perpendicular to I, R{c) is perpendicular to m. That 
is, the following system of equations has to be solved for R: 

{k,R{a)) = 0 
{l,R{b)) = 0 
{m,R{c)) = 0 

Using the computer-algebra program MAPLE®, this system was reduced in [1] 
to the solution of an algebraic equation in one variable of degree 8, whose coeffi- 
cients are lengthy expressions in the elements of the vectors a,b,c and k,l ,m. 
This equation can be solved numerically using Laguerre’s method [3]. 

In [6] the minimization of the mixed volume was carried out by taking into 
account Type-1 critical orientations only. The authors conjectured that this is 
actually sufficient to find the global minimum. 

In the next section we report on a number of experiments we did in order to 
verify this conjecture. 

4 Experimental Results 

In this section, we give results for minimization of the mixed volume 
V {A, R{B) , R{B)) of convex polyhedra A and B when R runs over the set of 
all spatial rotations. Both polyhedra were chosen to be tetrahedra, whose edge 
sizes varied randomly. For each pair A and B, we performed minimization in 
two ways, depending on the set of rotations taken into account: 

Type-1 minimization All critical rotations are of Type 1: a face of A is parallel 
to a face of B, and an edge of A is parallel to a face of B. 

Type-2 minimization All critical rotations are of Type 2: three edges of A 
are parallel to three faces of B. 

To verify the conjecture, we checked for each pair of tetrahedra A and B whether 
the result for Type-2 minimization was larger than that of Type-1 minimization. 



Remark In fact, in our implementation we use two routines, one which performs 
Type-1 minimization, and another one which minimizes over the combined set 
of both Type-1 and Type-2 critical rotations. If we find that the minimum in 
the second case is smaller than that in the first case, then we know that the 
conjecture is false. Also, instead of fixing A and rotating B, we may as well fix B 
and rotate A in view of the identity V {A, R(B) , R(B)) = V{B, B, R~^(A)). 
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In the experiments, we found cases among the randomly generated pairs of tetra- 
hedra for which the result for Type-1 minimization was actually larger than that 
of Type-2 minimization, although the differences were often small. An example 
where the difference is substantial is shown in in Fig. 2. In this case the mixed 
volume for Type-1 minimization equals 1.81213e-|-006, and that for Type-2 mini- 
mization equals 1.59156e-|-006, which is significantly smaller. The corresponding 
tetrahedra are shown in Fig. 2. 

It is interesting to look at the Minkowski sum of i? 0 R*{A), with R* the 
rotation which realizes the minimum of the mixed volume for Type-1 and Type- 
2 minimization, respectively, see Fig. 3. The corresponding slope diagrams are 
shown in Fig. 4, with the spherical arcs of A shown in bold. From the pictures, one 
can verify that indeed for Type-1 minimization, a spherical point of B coincides 
with a spherical point of A, and another spherical point of i? is on a spherical 
arc of A, whereas for Type-2 minimization, three spherical points of B are on 
spherical arcs of A. 

5 Conclusion 

We have studied the computation of a rotation-invariant similarity measure of 
convex polyhedra A and B, involving the minimization of a mixed- volume func- 
tional V {A, R{B) , R{B)) with R running over the set of critical rotations. Two 
types of critical orientations were distinguished: for Type-1 critical orientations 
a face of A is parallel to a face of B, and an edge of A is parallel to a face of B; 
for Type-2 critical orientations three edges of A are parallel to three faces of B. 
We performed experiments with randomly generated tetrahedra, and computed 
the minimum of the volume functional by taking into account either Type-1 or 
Type-2 critical orientations. We found that the result for Type-2 minimization 
can be larger as well as smaller than that of Type-1 minimization. Therefore, 
in contrast to what has been conjectured in [6], one has in general to take both 
Type-1 and Type-2 critical orientations into account to compute the global min- 
imum of the mixed volume. 
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Slope diagram of A Slope diagram of B 

Fig. 2. Polyhedra A and B as used in the experiment, for which V {B, B, R 2 {A)), 
with i ?2 the rotation which realizes the minimum of Type 2, is smaller than 
V{B, B, with the rotation which realizes the minimum of Type 1 
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B®Rl{A) 




m{A) 




B®m{A) 



Fig. 3. Top row: polyhedron A in the rotated configuration which minimizes 
mixed volume according to Type 1 (i?J(A)) and Type 2 (i? 2 (^))- Bottom row: 
Minkowski sums of polyhedron B and rotated polyhedron Rl{A), c.q. R^iA) 
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SDR(B © R 2 {Aj): front view SDR(B © R^iAj): back view 

Fig. 4. Top row: two views of the slope diagram of the Minkowski sum of poly- 
hedron B and rotated polyhedron Bottom row: two views of the slope 

diagram of the Minkowski sum of polyhedron B and rotated polyhedron (^) • 
Here and i ?2 are the rotations which realize the minimum of Type 1 and 
Type 2, respectively. Bold curve segments indicate spherical arcs of polyhedron A 
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Abstract. We describe an approach to compute the surface skeleton 
of a 3D object using distance information. It is based on iterative thin- 
ning of the distance transform of the object. The surface skeleton of an 
object should be topologically equivalent to the original object, centred 
within the object, a thin subset of the object, and such that the origi- 
nal object can be recovered from the surface skeleton. We emphasize the 
last property, i.e., the importance of having a reversible surface skeleton, 
and present a general framework for distance based skeletonization al- 
gorithms. Resulting surface skeletons for the D® and the distance 
cases are shown. 



1 Introduction 

The surface skeleton of a 3D object provides a representation of the object which 
is simpler and contains a smaller amount of data than the original object, without 
loss of the information necessary to perform shape analysis. This representation 
can be used directly for shape analysis or can be further reduced to a curve 
skeleton. An overview of surface skeletonization algorithms closely related to the 
algorithms presented here is given in Section 4. 

To be a useful representation, a surface skeleton should satisfy some prop- 
erties. It should be topologically equivalent to the original object, centred within 
the object, and a thin subset of the object. Another important property, not 
often considered among existing skeletonization algorithms for 3D objects, is 
that the surface skeleton should be reversible, i.e., that the original object can 
be recovered from the surface skeleton. This can be achieved by using distance 
information. If each voxel in the surface skeleton is labelled with the distance 
to its closest voxel in the original background, according to the chosen distance, 
and the set of centres of maximal balls is included in the surface skeleton, then 
the original object can be recovered, e.g., by applying the reverse distance trans- 
formation. The set of centres of maximal balls can easily be detected on the 
distance transform. By including this set, we also guarantee that the surface 
skeleton will be centred within the original object with respect to the chosen 
distance. 

The surface skeleton can be detected in several different ways. Most often, 
iterative thinning of the object is used. In this case, the border of the object is 
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iteratively removed, keeping voxels necessary for topology and shape preserva- 
tion, until the surface skeleton remains. This approach can be combined with 
distance information, i.e., the distance transform is computed and then iterated 
removal is directly performed distance layer after distance layer. In fact, on the 
distance transform, we have an easy identification of the border, since the suc- 
cessive borders are proper subsets of the successive layers. If we use the set of 
centres of maximal balls as anchor-points, i.e., non-removable voxels, during the 
thinning, a reversible surface skeleton is obtained. 

We start from previous work, [14,16], and present a novel way of detecting 
a reversible surface skeleton using iterative thinning of the distance transform 
of the object. Surface skeletons centred within the original object with respect 
to two different distances, and are considered (Section 3). The algo- 
rithms are presented in a general way, aimed towards a skeletonization algorithm 
which can, with small changes, be computed by iterative thinning of distance 
transforms based on any distance. With respect to the algorithms introduced 
in [14,16], the algorithms presented here have a considerably smaller computa- 
tional cost; they are framed into a common scheme which allows the user to 
select the preferred distance directly when the input object is provided; and the 
most important difference is that the algorithms better preserve shape informa- 
tion, which is needed if the skeleton is used for applications. A brief presentation 
of other skeletonization algorithms can be found in Section 4 and a discussion 
about future work in Section 5. 

2 Definitions 

The images are, in our case, 3D images and consist of object and background, 
i.e., are binary volume images. Each voxel can have three types of neighbours: 
face, edge, and point neighbours. We assume that the object is 26-connected 
and, thus, the background is 6-connected. 

The algorithms presented in this paper are based on the use of distance infor- 
mation from the background to voxels in the object. The distance is computed 
by propagating distance information locally and the distance increases from the 
border towards the inside of the object. This is done in one forward and one 
backward scan over the image. The result is a distance transform (DT) in which 
each voxel in the object is labelled with the distance to its closest voxel in the 
background. The labels in the DT depend on the chosen distance. Here, we will 
use the simple distances D® and for which the distance between two voxels 
is equal to the number of permitted steps in a minimal path between the two 
voxels. For D®, only steps in face directions are possible, while for steps in 
face, edge, and point directions are possible. We will denote the corresponding 
distance transforms DT® and DT^®, respectively. Information about DTs for 
3D images and on distances providing better approximations to the Euclidean 
distance can be found in [3]. 

In a similar way as for the DT, we can compute the reverse DT. Also in this 
case, the distance is computed by propagating distance information locally but 
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the distance decreases starting from a number of voxels with distance labels. 
This is done in one forward and one backward scan. 

Each distance label in the DT of an object can be interpreted as the radius of 
a ball (in the underlying distance) fully enclosed in the object. A voxel is said to 
be a centre of a maximal ball (CMB) if its corresponding ball is not completely 
covered by any other ball. The union of the maximal balls coincides with the 
object. 

The set of CMBs can for and be detected as voxels having no neigh- 
bouring voxels with higher label. Again, for D^, only face neighbours are consid- 
ered. An object can be represented by its set of CMBs, since the object can be 
recovered from the set of CMBs by applying the reverse distance transformation. 

A voxel u is a simple voxel if its removal does not alter the topology, i.e., the 
removal of v does not change the number of object components, the number of 
cavities, or the number of tunnels in a 3 x 3 x 3 set of voxels centred on u, [2,9]. 
To verify whether the number of object components changes, the number of 26- 
connected components in the 26-neighbourhood of v, which consists of all 

face, edge, and point neighbours of v, can be counted. Object voxels with yf 
1 should be ascribed to the skeleton. To preserve the number of cavities and 
tunnels, a subset of the 26-neighbourhood is used, which consists of all face and 

edge neighbours of v, the 18-neighbourhood of v. The number of 6-connected 



background components in the 18-neighbourhood that are 6-adjacent to v {N j: ) 
is counted. Voxels with iVy* yf 1 should be ascribed to the skeleton. 

Note that since a simple voxel is defined by A^^® = 1 and A^y* = 1, a simple 
voxel has a face neighbour in the background. In fact, a voxel with no face 
neighbours in the background has A^y =0. The set of voxels with N j: >0 are 
denoted as the border of the object. 

As a running example for this paper, we will use a Euclidean ball. This might 
seem to be a trivial object, but this is not the case since we use distances (T>® 
and H^®) different from the Euclidean distance. The object together with its 
CMBs for HT® and HT^® are shown in Fig. 1. We remind that the H® and 
the I?^® balls are shaped as octahedra and boxes, respectively, see Fig. 2. 

3 Skeletonization by Iterative Thinning of the Distance 
Transform 

To compute the skeleton of a 2D planar pattern, it is possible to use an algo- 
rithm based on iterative thinning guided by distance information. The set of 
centres of maximal discs together with pixels needed for preserving the topol- 
ogy are kept during the thinning, see, e.g., [15]. If we generalize the algo- 
rithm for 2D objects to deal with 3D objects, a possible algorithm would be: 
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Fig. 1. The running example, a Euclidean ball with radius 28 voxels, with its 
CMBs for DT^ and 




Fig. 2. The ball with radius 28 voxels, left, and the ball with the same 
radius, right 



1. Compute the DT of the object. 

2. Detect the set of centres of maximal balls on the DT. 

3. For each voxel with distance label k and a face neighbour in the background: 

(a) mark simultaneously the simple voxels that are not CMBs. 

(b) remove sequentially marked voxels that are still simple when they are 
visited. 

Step 3 might need several iterations, depending on the underlying distance, until 
every voxel with distance label k is either a skeletal voxel or is removed. This is 
because the voxels with distance label k do not necessarily have a face neighbour 
with label fc — 1 on the DT, unless D® is used. Hence, some voxels with distance 
label k can not be considered for removal directly after the voxels with label 
k — 1 have been processed. Those can be considered only after a face neighbour 
with distance label k has been removed. 

When applying the skeletonization algorithm described above to our running 
example, we obtain the D® and the skeletons shown in Fig. 3. They are topo- 
logically equivalent to the original object, centred within the object with respect 
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to the underlying distance, and fully reversible. However, visually they are not 
what would be considered as good surface representations of the original object, 
but are rather “combs” of curves. For shape analysis purposes, this is not a con- 
venient representation. We aim to have a surface representation centred within 
the object instead of a set including “combs” of curves. Thus, for 3D objects, 
skeletonization is more complicated, since also the gaps between the curves need 
to be filled, or rather not to be created. What is needed is a modification of the 
above algorithm to add a suitable condition for surface preservation. In detail. 
Step 3 becomes: 

3.^ For each voxel with distance label k and a face neighbour in the background: 

(a) mark simultaneously the simple voxels that are not CMBs. 

(b) remove sequentially marked voxels that are still simple and satisfy 
ConditioUrm- 

The condition for removal, ConditioUrm, differs for different distances. 
ConditioUrm together with the resulting surface skeletons extracted from DT^ 
and will be described in the following Sections 3.1 and 3.2. 



3.1 Surface Skeletons 



In an algorithm for detecting a surface skeleton of a 3D object based on iterative 
thinning of DT®, we use the following condition (see Fig. 4(a)) 

Condition^ There exists a pair of opposite face neighbours of a voxel v such 
that one is a voxel not included in the skeleton being a background voxel 
and the other is an internal voxel, i.e., a voxel with higher distance label. 



This is equivalent to assigning a voxel to the skeleton if it has no pair of opposite 
face neighbours such that one has lower distance label without being included 
in the skeleton and the other has higher distance label. Condition^^ prevents 
removal of CMBs, since voxels having only face neighbours with lower or equal 
distance labels, i.e., CMBs, do not satisfy Conditionals and, hence, are not re- 
moved. Besides the CMBs also voxels in saddle configurations are prevented from 




Fig. 3. and skeletons topologically correct but not preserving surfaces 
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Fig. 4. Voxels involved in the Conditionsrm used during voxel removal/marking 
for (a) and (b) surface skeletonization. White, grey, and black denote 
background voxels, border voxels, and internal voxels, respectively 



removal. These are voxels with pairs of opposite face neighbours both with labels 
higher or equal, or lower or equal than the voxel itself. Moreover, some voxels 

needed for topology preservation are prevented from removal by Conditionf^j^, 



without computing the more computationally heavy N ^ . 

We start with computing DT^ of the object. Although CMBs would be au- 
tomatically kept, as they do not satisfy Condition^^, we prefer to identify them 
separately. This will allow us to have within the same framework both this al- 
gorithm and the one based on but for which a similar condition, meaning 
also keeping CMBs, does not hold. On I?T®, the CMBs are detected. The labels 
in DT^ can be interpreted as iteration numbers for the iterative thinning. Only 
border voxels need to be considered for removal. During each iteration, the bor- 
der consists of all voxels having the current distance label together with voxels 
already detected as non-removable, i.e., with skeletal voxels with lower distance 
labels. 

Each iteration is done in two scans. During the first scan, simple voxels 
that are not CMBs are marked. This is done simultaneously for all voxels in 
the border. During the second scan, marked voxels, still being simple voxels 
and fulfilling Condition^^ are removed. This is done sequentially to prevent 
altering the topology. When all successive borders have been processed, all non- 
skeletal voxels have been removed and the voxels constituting the surface skeleton 
remain. The resulting D® surface skeleton of the running example is shown in 
Fig. 5, left. 



3.2 Surface Skeletons 

For the iterative thinning is slightly more complicated. The voxels with 

distance label k are not guaranteed to have a face neighbour with distance label 
k — 1 since the distance is used. This implies the use of more than two scans 
in each iteration. In fact. Step 3' has to be repeated three times to give all voxels 
with distance label k the possibility either to become a skeletal voxel or to be 
removed. Moreover, Condition^^ is not suitable, since it is specifically tailored 
for DT®. 

A suitable Condition^^ for a voxel v is (see Fig. 4(b)) 
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Fig. 5. and surface skeletons 



Condition^" No pair of opposite face neighbours of v exists where the voxels 
in the pair are either internal voxels or border voxels that are not marked 
for removal. 

This is equivalent to saying that v is prevented from being removed if it has a 
pair of opposite face neighbours each of which has either higher distance label, 
or the same distance label without being marked for removal. 

Also in this case we start by computing the DT, of the object. On 

DT^^, the CMBs are detected. For each iteration k in the thinning process, only 
voxels with distance label k and a face neighbour in the background are consid- 
ered for removal. In the same way as for detecting the D® surface skeleton, we 
first mark suitable voxels simultaneously and, among the marked voxels, remove 
voxels sequentially. During the marking step, simple voxels that are not CMBs 
are marked. During the removal step, marked voxels, still being simple voxels 
and fulfilling Condition^^ , are removed. The resulting surface skeleton of 
the running example is shown in Fig. 5, right. 



4 Related Work 

The number of papers dealing with reversible skeletonization for 3D objects or at 
least with skeletonization guided by the DT is increasing. This is not surprising 
since the number of applications where 3D images are used is increasing. Skele- 
tonization is an efficient way of reducing the amount of data that needs to be 
analyzed. This is especially important for 3D images since the amount of data is 
large. Moreover, by reversible skeletonization, (little or) no information is lost. 
Most papers on reversible skeletonization published so far deal with iterative 
thinning in some way guided by the DT. 

In [4] , Borgefors et al. presented an algorithm for iterative thinning where the 
border for each iteration is detected by identification of voxels having N j: = 1. 
This can easily be changed to be guided by DT^, so obtaining a fully reversible, 
distance guided skeletonization algorithm. However, a number of unnecessary 
voxels are added on the surface compared to the D® surface skeleton presented 
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in this paper, especially on the edge or in the regions close to the edge of the sur- 
faces in the surface skeleton. The unnecessary voxels added on the surface cause 
that the analysis of the skeleton becomes more difficult, e.g., the curve skeleton 
computed from the surface skeleton will have more (unnecessary) branches. 

An algorithm for which the iterative thinning is distance guided was pre- 
sented by Pudney in [7]. There the weighted (3,4,5) DT for which the distance 
is given by a minimal path where the weight for a step in face direction is 3, 
in edge direction 4, and in point direction 5, [3], is used. Two alternatives to 
skeletonization algorithms are given. One of these includes the set of CMBs in 
the skeleton. It is, however, not clear whether the skeleton is a surface skeleton 
or “combs” of curves. The latter is most possibly the case. Even though nothing 
is mentioned about reversibility in [7], the original object can be recovered from 
the skeleton, since it includes the set of CMBs. 

In 1995, Toriwaki and Saito introduced an algorithm where thinning is guided 
by the Euclidean DT of the object to be able to have a rotation invariant re- 
sult [11]. The result is a skeleton which is topologically equivalent to the original 
object but produces a lot of unnecessary branches. This has recently been com- 
bined with the approach introduced for 2D images by Ragnemalm in 1993, [8], 
to have a less branchy skeleton. The resulting algorithm is called anchor-point 
thinning and was presented during the winterschool “Digital and Image Geome- 
try”, [10,1]. It was among the last works by the late Dr. Saito. The algorithm is 
computationally heavy, but produces good and stable results, and aimed to be 
a tool for specific applications, e.g., virtual endoscopy systems, where the object 
is a structure with cylindrical shape. 

Another algorithm for which the Euclidean DT is used was presented by 
Malandain and Fernandez-Vidal in [6]. Two “skeletons” are combined, one giving 
the desired level of detail, i.e., giving a more robust skeleton, and the other 
preserving the topology of the original object. The final skeleton is obtained by 
topological reconstruction of the skeleton giving the desired level of detail from 
the topologically correct skeleton and allow almost full recovery of the original 
object (depending on the chosen level of details). 

The idea of iterative thinning of a DT to preserve topology has been used also 
by Bouix and Siddiqi in [5]. They combine previous work, [13], not guaranteed 
to preserve topology with iterative thinning guided by the Euclidean DT. The 
resulting surface skeletons allow almost full recovery and are rotation invariant. 

In [12], a fully reversible skeletonization algorithm resulting in a D® surface 
skeleton was presented, which does not use iterative thinning of the DT. Instead 
skeletal voxels are detected directly on DT® in a number of scans, generally 
smaller than that required by iterative thinning of the DT, especially for thick 
objects. The number of scans is independent of the thickness of the object, which 
is not the case when iterative thinning is used. 
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5 Discussion 

In this paper, we have presented a general framework for fully reversible surface 
skeletonization. The algorithm is simple and reconstruction of the original object 
is possible in an easy way, e.g., by using the reverse distance transformation. We 
have presented algorithms for the and the distances. Work still remains 
to obtain a fully reversible surface skeletonization algorithm that can be based 
on any distance. 

For the algorithms presented here, some minor improvements can be done. 
ConditioUrm used during the removal part of the iterative thinning is not op- 
timal, as can be seen in Fig. 5. Still some problems remain when building the 
surfaces, resulting in rather jagged border of the surface, especially for the 
surface skeleton. 

Surface skeletons for a number of different objects are shown in Fig. 6. All 
objects are rather small (64 x 64 x 64 images) and created by merging different 
simple geometrical shapes such as balls and cylinders. The intention is to be able 
to show how the algorithms perform in detail. In Fig. 7, a slightly more complex 
and larger object is shown together with its D® and D^® surface skeletons. 

We are aware that T>® and I?^® are rough approximations of the Euclidean 
distance. Hence, the surface skeletons are not rotation invariant. Future work will 
be to find similar algorithms based on, e.g., weighted distances such as (3,4,5). 
Such algorithms would result in surface skeletons that are almost rotation in- 
variant. 

When the Euclidean DT is used, [6,5,10], the computational cost is higher 
than if a weighted DT, is used. It is also more computationally heavy to guarantee 
full reversibility and to actually reconstruct the original object. 

Note that as long as iterative thinning is used, the number of iterations 
needed is proportional to the thickness of the original object. By using an ap- 
proach similar to the one introduced in [12], this could be avoided. More in- 
vestigation should be done to develop such algorithms for other distances than 
DT®. One step in that direction is to first develop iterative algorithms for other 
distances and then use the experiences from that work. 
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Abstract. In this article, we present the Euclidean distance transforma- 
tion (EDT) of three-dimensional (3D) binary pictures and applications, 
with stress on Dr. Toyofumi Saito’s work in collaboration with one of the 
authors. First, EDT, skeletons, and reverse EDT are defined, and then 
the algorithms to perform them are presented. Next, a sequential-type 
3D thinning algorithm that preserves the topology of an input object 
is introduced. This algorithm is derived by combining an ordinary two- 
dimensional thinning algorithm with EDT. Finally, applications to med- 
ical image processing are presented such as automated path Ending in 
virtual endoscopy, as well as analysis of 3D pathological sample images. 
Dr. Toyofumi Saito, an associate professor who was one of the most 
active researchers in the field of image processing in Japan and a young 
leader of the author’s laboratory, and who was initially scheduled to be the 
first author of this paper, passed away on 26 October 2000. We have lost 
a most reliable and most promising colleague, an experienced supervisor, 
and a very sincere friend. He is deeply missed by all in his family, his 
friends, and his colleagues. On behalf of the laboratory the authors wish to 
dedicate this short note in memory of Dr. Toyofumi Saito and to express 
our sincere condolences to his loved ones. 



1 Introduction 

Distance transformation (DT) is the transformation of a binary picture where the 
value of a 1-pixel is replaced by the distance to the nearest 0-pixel. DT is a very 
useful tool in binary image processing for shape feature extraction and thinning. 
Various studies have been reported concerning algorithms and applications of 
distance transformation since the early years of image processing [16,25]. 

From the viewpoint of parallel type and sequential type algorithms, the main 
issues are extension to 3D images and consideration on different distance metrics. 
Saito’s most important contributions to these topics were in studies on DT using 
a Euclidean metric and the applications of DT to thinning for two- and three- 
dimensional picture. 
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Fig. 1. Three-dimensional (3D) digital picture and 3D distance picture. The 
numbers in the right object show 26-neighbour distances 



In this short note, we introduce an outline of results concerning Euclidean 
distance transformation (EDT), obtained by Dr. Toyofumi Saito in collaboration 
with J. Toriwaki of Nagoya University in Japan. First, we introduce an efficient 
EDT algorithm, which gives strict Euclidean distances. Second, we define what 
a skeleton is and present the reverse EDT briefly. Thirdly, we discuss a few 
applications including an improved thinning algorithm for 3D objects. The last 
topic is about a new thinning algorithm that has the ability to adjust the degree 
of occurrence of spurious short branches and degeneration, which was reported 
in the last paper of Dr. Saito submitted about a month before his death [24]. 

2 Distance Transformation 

2.1 Definition 

Let us consider a 3D digital picture F = {fijk} where fijk is the density (gray 
value) at a voxel (i,j,k). The picture F is called a binary picture if fijk takes 
only densities 0 and 1. We call a set of 0- voxels, So, the background and a set 
of 1-voxels, Si, an object, respectively (Fig. 1). 

Then, the distance transformation (DT) of binary picture F is defined as 
follows: 

Definition 1. DT : F = {fijk} ^ G = {gijk} 

_ ( mm{dist{{i,j,k),{p,q,r));{p,q,r) G Sq], if (i,j,k) € Si, 

lO, tf{iJ,k)G So, ^ > 

where dist{a, b) = distance between two voxels a and b. We call the picture 

G = {gijk} the distance picture. 
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A transformation in which dist'^{a, b) is calculated instead of dist{a, h) above 
is called a squared DT. 

Intuitively, DT is a transformation that gives the distance of each voxel 
to its nearest 0- voxel. Various distance metrics are used as dist{a,b). For 3D 
digital pictures, the 6-neighbour distance, the 18-neighbour distance, and the 
26-neighbour distance are popular, and the Euclidean distance is familiar by the 
analogy of the continuous space and the real world. Weighted DT has also been 
studied for 3D pictures [2,3,14]. For simplicity of description, we write DT using 
the above distance measures as the 6-n DT, 18-n DT, 26-n DT, and Euclidean 
DT (EDT), respectively. 

2.2 Algorithm 

Algorithms to calculate DT for a given 3D binary picture have been studied since 
the 1970’s [2,6,14,25], and efficient algorithms have been found for 6-n, 18-n, and 
26-n DT, including both parallel type and sequential type algorithms. However, 
only a very few have been found for EDT. 

Strategies for an algorithm to obtain EDT are classified as follows: 

1. Record the differences of coordinate values in three axes and perform an 
exhaustive search for voxels in a suitable neighbourhood to find the voxel 
minimizing the Euclidean distance [28]. 

2. Use vector propagation using masks of neighbourhood voxels whose sizes 
often are larger than 3x3x3 voxels [4,6,14]. Both parallel and sequential 
type algorithms are known [4,6,14]. 

3. Calculate the square of EDT instead of the exact EDT value [18,19,20]. 

The first case is the straightforward implementation of distance minimiza- 
tion and easy to understand. The computational load and memory requirement 
rapidly increase for large images and especially when working in higher dimen- 
sions. 

The second method is easily implemented both in parallel type and sequential 
type algorithms. Basically, it gives approximations to the Euclidean distance, and 
the accuracy of an approximation is good with masks of suitable shapes and a 
computation of sufficient iterations. 

The last procedure is to minimize [dist{{i,j,k),{p,q,r))'^] in Eq. (1) with 
the use of an Euclidean metric. This is the simplest procedure and is executed 
most effectively by computer among the three methods. The square roots of all 
obtained values must be calculated in order to know the exact distance values. 
Actually, this is not always necessary in most applications. The relations among 
the distance values often provide enough information for practical problems using 
DT. The extraction of a local maximum (or minimum) and equidistance voxel 
detection are typical examples. In this paper, we treat only EDT. 
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2.3 Reverse Distance Transformation 

Reverse distance transformation (Rev. DT) is a transformation to restore a DT 
picture from a finite set of voxels and the distance values on them. We define it 
as follows. 

[Definition 2] Given a set of voxels S = {si, S 2 , . . . , s^}, where s^ = 
km) Is a voxel with a density value we consider the processing 

that obtains the set of all voxels P such that 



P — j ^ k)]3TTl^ (^i im) {j jm) T ^m) 

— (^m; jm-i km) ^ ®}* 

If is the squared EDT value at a voxel s^, the above processing is 

called the reverse squared EDT. 

The above equation means that for an arbitrary voxel q = (i,j,k) in the 
set P, there exists at least one voxel on which the squared EDT value is 
larger than the squared distance between q and Sm- This also means that the 
0-voxel nearest to is further from Sm than q. Therefore, the voxel q is inside 
an object in an original input picture. 

Rev. DT is effectively performed by an algorithm similar to EDT as shown 
in [21]. The details of the algorithm are skipped here. Reverse EDT is also pre- 
sented in [4,5] . Its strategy is to put the balls of the radii given by the distance 
values on skeleton voxels with small numbers of scans after selecting the min- 
imum set of necessary balls [4,5] or to extend skeleton voxels by an algorithm 
similar to EDT [21]. The comparison of efficiency level has not been reported 
to date. How the voxel set S (=skeleton) can be obtained is presented briefly in 
Section 3. 

3 Skeleton and Skeletonization 

3.1 Definition of a Skeleton 

The words “skeleton” and “skeletonization (= the process to extract a skeleton)” 
have different meanings, and they are not always defined uniquely. 

A skeleton in relation to DT seems to be considered in different ways as 
follows. 

1. The minimum set of voxels such that all input objects are restored exactly 
using their locations and the distance values on them. The term of “mini- 
mum” here means the smallest number of voxels of the set. Practically, the 
detection of the strictly minimal set is not easy, and a suboptimal set is 
utilized. The resultant skeleton is not unique; this is so even if strict mini- 
mization can be achieved. 

2. The set of local maxima of DT G = {gijk}- The size and the shape of 
the neighbourhood used for the determination of the local maxima vary 
corresponding to the kind of connectivity. 
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3. The center line or center surface of an input object. In this case, the preser- 
vation of topological features of the original pictures is strongly required. 

4. A set of center voxels of balls included in an input object and touching the 
border of the object at more than one voxel. 

None of these exactly coincides with the others. Obviously, the first is the 
subset of the fourth. We think the third case should be called thinning (the word 
skeletonization might be acceptable, but confusing) and excluded in discussions 
concerning DT. 

In the case of EDT, the problem is more complicated. The definition of the 
term “skeleton” has not been discussed systematically except in [21]. In this 
paper, we adopt the first case as the skeleton, and discuss its properties. 

Rev. EDT provides a method to restore at least parts of an object from a 
suitable set of voxels with the EDT values on them. Then, the problem is how 
to find the minimum set from which the original object can be restored exactly. 
One available algorithm is the iterative testing of the coverage relation among 
balls and objects as presented in [5,11]. A strict minimization is still difficult and 
we have not succeeded yet in finding a better algorithm for this. Presently, we 
define skeletons of EDT as follows. 

[Definition 3] (skeleton type 1) Given the squared EDT G = {gijk} of a 3D 
picture F = {fijk}, the set of voxels S defined below is called the skeleton 
(type 1) of picture F and the EDT G. 



S = {{ij,k);3{x,y,z),{i- xf + {j - yY + [k - zf < 
and max {guvw] {x - uf - {y - - {z - w)'^ < gijk} = gijk} 

{u,v,w) 

This means that the skeleton is a set of centers of balls with a radius equal to 
an EDT value having at least one voxel as its dominating area. The dominating 
area of a ball C means a set of voxels covered by only the ball C itself. 

We have proposed another definition of the term “skeleton” slightly different 
from the above in [21]. 

[Definition 4] (skeleton type 2) Given the squared EDT G = {gijk} of a 3D 
picture F = {fijk}, the skeleton of G (and F) is defined as the set of voxels 
S' given below. 



S' = {{ij,k)-.,3{x,y,z),{i- x)'^ + {j - yf + {k - zf < gijk, 
and max {guvw ~ {x — u)^ ~ {y ~ v)'^ ~ ~ = 

{u,v,w) 

gtjk -{x- if -{y- jf -{z- kf} 



The underlying idea is as follows. Let us assume that a voxel {i,j,k) is the 
only one skeleton voxel in a given object and its squared distance value is gijk- 
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Fig. 2. Illustration of the skeleton type 2 for the two dimensional case (Pixels P\ 
and P 2 are included in the skeletons. P 3 is not included) 



Then, a squared distance value at a voxel (x, y, z) in this object will be given 
by gxyz = 9ijk — (* — — {j — uY ~ The skeleton in this case is a set 

of centers of elliptic paraboloids, which are not covered perfectly by any other 
elliptic paraboloid. The elliptic paraboloid here is the one having the maximum 
value at its center voxel and the value is equal to the squared distance value of 
the center voxel (Fig. 2 ). 

Examples are shown in Fig. 3. For the details of these definitions including 
intuitive interpretations and examples see [21]. Note that the skeletons defined 
here do not preserve the topologies of the input pictures. 

3.2 Restoration 

Let us assume that a set of voxels and the distance values on them are given. If 
the distance value at a voxel P is dp, then no 0- voxel exists within the distance dp 
from P. In other words, a ball with a radius dp and its center at P is included in 
the input object. Then, a single voxel P is extended (dilated) in all directions in 
the 3D space within the distance of dp from P without going outside the object. 
The result is a part of the original object. 

If a suitable set of voxels is given as seeds associated with the values of the 
degree of dilation, then the original object can be recovered as the sum of the 
sets obtained by the dilation. The dilation procedure is also implemented by a 
simple algorithm similar to EDT as shown in [21]. The skeleton defined above is 
used as the most suitable seed set. 

3.3 Skeleton Extraction and Restoration 

To extract the skeleton, all local DT maxima are first extracted. Then, the next 
step is critical where the set of local maxima has to be reduced without loss of 
the restoration property. 
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(c) (d) 



Fig. 3. Examples of skeletons based upon Euclidean distance transforma- 
tion [21]. (a,b) Type 1 (Definition 3) (c,d) Type 2 (Definition 4) 



As mentioned above, the restoration is performed by Rev. EDT from the 
skeleton given by Definitions 3 and 4. This algorithm is easily implemented by 
an ordinary computer. It is obvious from the definitions that the input object is 
exactly restored by the above procedures from the skeleton. The following items 
are known concerning skeletons. 

1. Skeletons by Definitions 3 (Type 1) and 4 (Type 2) do not coincide with 
each other. The number of skeleton voxels is usually larger for type 2 than 
for type 1. 

2. The Rev. squared EDT shown in [21] does not restore the EDT values, 
although the shape of the input object is recovered. If we need to recover 
the EDT values, the best way currently available is to recalculate the squared 
EDT from the restored object. Since both the squared EDT and Rev. squared 
EDT are fast, the sequence of the processing <C skeleton — > Rev. squared 
EDT — > squared EDT ;g> is not very time-consuming. 
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3. The set of local maxima detected by the 6- neighbourhood or the 26- neigh- 
bourhood does not guarantee the exact restoration of the original object. 
One algorithm [14] is available to extract a skeleton satisfying all restoration 
requirements by simple local operations with a look up table. The algorithm 
to extract the type 2 skeleton is similar to reverse squared EDT and ex- 
tended to higher dimensions straightforwardly [21]. Computation time of 
the extraction of skeleton 2 was smaller than skeleton 1 in our limited size 
of experiments [21]. More systematic study of efficiency is expected in the 
future. 

4. The minimization of the number of skeleton voxels satisfying Definitions 3 
and 4 is difficult except by an exhaustive search. After extracting a set of 
candidate skeleton voxels, the procedure to reduce skeleton voxels is available 
as shown in [5,11]. More efficient algorithm has not been found yet although 
a few suboptimal methods can be found in [21]. 

4 Applications 

The information obtained by using DT includes: 

(a) the distance value between two objects or from a point to objects on a 
picture, 

(b) shape information about the equi-distance surface, 

(c) skeleton, 

(d) how information is propagated. 

The use of EDT instead of the popular 6n-DT or 26n-DT improves the results 
in (a), (b), and (c) significantly. We show several examples in this section. 

The advantages of EDT over other types of DT are first that it gives a real 
Euclidean distance value used in the real world, and second that it is homo- 
geneous in all directions. The development of the algorithms mentioned in the 
previous section has greatly contributed toward making use of these advantages 
in practical image processing. 

4.1 Thinning 

Thinning (which is often called skeletonization) is the processing to extract the 
center line (line thinning) or to decide the center surface (surface thinning) of a 
thick 3D object [7,10,27]. The connectivity of an input picture is kept unchanged 
in the result of a case of thinning. This requires that the resulting line or surface 
object be located at the center (or near the center) of the input object. 

One of popular approaches among algorithms thus far developed is as de- 
scribed below [7,12,13,17,22,23] 

1 . Test the 1-voxels at the edge of an object in terms of deletability, and replace 
them by 0-voxels (deleted) if deletable. There are two possible strategies. 

(a) Delete a deletable 1-voxel at the moment it is detected (sequential type). 



420 



Jun-ichiro Toriwaki and Kensaku Mori 



(b) Examine all edge voxels and give a mark to a deletable voxel. Then, 
delete all of the voxels given marks simultaneously (parallel type). 

2. Apply the above process to only the upper edges first. After processing all of 
the upper edge voxel, process the left edges. Apply the procedure to the lower 
edges, the right edges, the near side edges, and the far side edges sequentially. 
Typically, use six iterations for 3D pictures (subcycle). However, subcycles 
are not always used in some of the thinning process. 

3. Iterate (1) and (2) until no deletable voxel can be found (main cycle). 

A subcycle is needed to proceed with the deletion of 1-voxels at an equal 
rate from all directions and by doing so, to locate thinned results at or near the 
center of an object. 

There are many variations in the details, but most existing algorithms em- 
ploy this type of subcyle - main cycle structure. One problem common to this 
structure is that the results are too sensitive to the rotation (or the direction) of 
the input object. For instance, the thinned result of a 45-degree rotated object 
is often severely deformed from the rotation after thinning at the same angle. 
This problem is greatly improved by utilizing EDT in the above procedure. 

The algorithm is modified as follows [22]. 

The squared EDT of a given input picture is used as the input picture to the 
revised version of the algorithm. (The word “squared” is neglected for simplicity 
of description.) 

1’ Find the minimum of the EDT. In the first step of the revised algorithm 
only the voxels with the minimum EDT value are processed just as the edge 
voxels in (1). 

2’ Apply process (2) to the voxels with the minimum EDT value. 

3’ After completing (2’), process those voxels with the second smallest EDT 
value in the same way as in (2’). 

4’ Iterate the whole of these for all values of EDT, starting from the voxels 
with the smallest value of EDT to the voxels with the largest ones. 

Experimental results are shown in Figs. 4, 5 and 6. Although Ref. [27] is 
often referred as one of the classic papers on 3D thinning, different methods 
have appeared recently [12,13,17,29]. More systematic comparative studies can 
now be expected. Fig. 6(b) was obtained by only deleting the subset of deletable 
voxels (belonging to three-dimensional simplexes), deriving Fig. 6(c) by deleting 
all deletable voxels while keeping the topology unchanged. For a more detailed 
evaluation of the algorithm, it is necessary to compare the algorithm with more 
recently published methods such as [29] . 

The computation time is not affected much by the use of EDT here because 
the EDT is performed only once during the thinning process and its weight is 
not so heavy compared to the remaining parts of the algorithm. 

This algorithm can be combined with anchor point thinning [14], and by 
doing so, we can derive a thinning method with a function to adjust the degree 
of occurrence of spurious branches, which is in one of the latest papers by Saito 
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Fig. 4. Experimental results of thinning by [22,23]. (a) Input object (connected 
balls), (b) Input object (cross), (c,d) Algorithm by Saito et al. [22,23], (e,f) 
Algorithm by [7], (g,h) Algorithm by [27] 
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Fig. 5. Experimental results of thinning rotated pictures (a) Input object rotated 
by 15 deg. and (b) 35 deg., (c,d) Result of thinning by [22,23], (e,f) Result of 
thinning by [7], (g,h) Result of thinning by [27] 
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(a) (b) (c) 



Fig. 6. Thinning experiments (Hand) (a) Input picture (Hand) (b) Result of 
thinning (surface thinning) (c) Result of thinning (axis thinning) 



and his colleagues [ 24 ] . If we want to guarantee the preservation of some voxels 
after the thinning, we can give a kind of mark to each of these voxels before 
starting the thinning regardless of deletability. We call such voxels here anchor 
points. They are never deleted during the thinning process. Accordingly, the 
resultant thinning lines or surfaces always pass these anchor points. The method 
consists of the following steps: 

a. Skeleton extraction 

b. Skeleton selection 

c. Anchor point thinning with EDT 

d. Post-thinning 

The first step (a) is presently performed according to the direct implemen- 
tation of Definition 3 or 4 . The step (b) is the key step of this procedure [ 1 , 24 ]. 
Let us consider two skeleton voxels P\ and P2 with EDT values g± and 52, re- 
spectively (Fig. 7 ). Assuming gi > 52, the skeleton voxel P2 is excluded from 
the anchor point set if both of the following conditions hold: 

[Condition i] A ball with a radius r2 and its center at a voxel P2 is covered 
by an ellipsoid E specified as follows: 

center = Pi, radii=(a ■ gi, gi, gi), where a is a parameter, and the direction 
of the largest radius coincides with the line connecting Pi and P2 ■ 

[Condition ii] gi + g2 P d, where d is the distance between Pi and P2- 

Condition i means that skeleton voxels with small DT values near the skele- 
tion voxels with large DT values are eliminated. Condition ii requires that 
skeleton voxels Pi and P2 belong to the same object. The parameter a can 
be adjusted in each applications. For a = 1 . 0 , no skeleton point is eliminated. 
For larger a, skeletons with larger EDT values are excluded. In addition, the 
occurrence of spurious branches is greatly suppressed, while the possibility of 
degeneration increases in the thinning result. 
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The generation of spurious short branches and the occurrence of degeneration 
or the disappearance of correct short branches are in a trade-off relationship. The 
application determines which branches are false and which are correct. It is dif- 
ficult to give a general mathematical definition of the “correctness” of branches. 
In individual applications, however, it is possible to estimate the relations among 
them as a function of the parameter a like the ROC curve (receiver operating 
characteristic curve) in statistical decision-making by executing a preliminary 
experiment beforehand. We can select a suitable operation point on this curve. 
An example is shown in Fig. 8. 

4.2 Path Finding in Virtual Endoscopy 

With the recent progress in imaging technology in the medical field, a three- 
dimensional image of the human body can now be taken by a computer much 
more easily than before. We call an image a virtualized or virtual human body 
(VHB) [26]. On VHB we can freely navigate the part of the human body we 
want to examine. We call this navigation diagnosis. In particular, flying through 
a pipe structured organ or an organ with a cavity inside (e.g. bronchus, vessel, 
colon, or stomach) can be considered as a kind of simulation of endoscopy, and 
called a virtual (or virtualized) endoscopy system (VES) [8,15] (Fig. 9). 

In a VES for screening use, the automated selection of the path for fly- 
through can often reduce a doctor’s load. Accordingly, we tried to apply the 
thinning algorithm to the bronchus and colon extracted from a 3D CT image 
by segmentation techniques in order to use the center line as the path of the 
viewpoint for an automated VES. Examples are shown in Fig. 10 [9]. Results 
were presently accepted as reasonable by medical doctors, and validation by 
large number of clinical cases remain for future study. 
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Fig. 8. Control of spurious branches (a) Variation in the number of spurious 
branches and degeneration with the parameter a. The horizontal axis = the 
number of spurious branches, and the vertical axis = the numbers of degenerated 
branches. The numerals in the graph show the values of the parameter a. A black 
square at the bottom of the graph shows a result of thinning with EDT by [23] , 
(b) Examples. Input object and results for a= 1.5, 2.0, 3.0, (c) Magnified views 
of a part of branches for each a 
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Fig. 9. Virtualized endoscopy system de- Fig. 10. An example of the auto- 
veloped by the author’s group [8] mated determination of a path with 

the virtualized endoscopy system 
shown in Fig. 9 [9] 



4.3 Analysis of 3D Pathological Images 

The third example is an analysis of a 3D microscope image of human liver. 
^From cross sections of a pathological sample of the liver, we extracted portal 
vein and hepatic vein borders, and reconstructed the 3D micro structure of 
capillary blood vessels in the liver. We then analyzed the shape features of 3D 
patterns or arrangement patterns of vessels by using 3D EDT and a 3D digital 
Voronoi diagram [18]. As one example, we could confirm the constancy of the 
distances to the portal and hepatic veins from an arbitrary point in the liver, 
and could also show characteristics of the relationship between the veins and 
necromatic zones. Figure 11 shows experimental results. 

5 Conclusion 

In this article, we introduced research work concerning distance transformation 
and applications of DT performed by Dr. Toyofumi Saito and his colleagues 
at Nagoya University. First, a Euclidean distance transformation algorithm was 
presented briefly, and then applications of the algorithm to thinning and med- 
ical image processing were shown. Here we explained how the 3D thinning was 
improved by utilizing EDT and presented a method to control the occurrence of 
spurious short branches. 

Acknowledgements 

The authors thank colleagues for their useful suggestions and discussions. Parts 
of this research were supported by the Grant-In-Aid for Scientific Research from 
the Ministry of Education, the Grant-In-Aid for Scientific Research from Japan 




Distance Transformation and Skeletonization of 3D Pictures 



427 




Fig. 11. Analysis of 3D pathological images, (a) 3D reconstruction of a hepatic 
vein in the liver and (b) Distance to the nearest hepatic vein 
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Abstract. We study the limiting behaviour of the repeated application 
of a discrete image operator that belongs to a recently introduced class 
of morphological operators. To this purpose, we describe the implied 
iterative process by a discrete dynamical system. 

The convergence is derived for binary and gray scale images. 



1 Introduction 

Mathematical morphology is an important branch of non-linear signal process- 
ing. It has its roots in discrete geometry and set theory, using as a starting point 
the probing of an image by a small(er) structural element. We refer to [3], [6] 
and [7] for an overview. 

This geometric foundation often facilitates an intuitive interpretation of mor- 
phological operators in terms of geometric concepts and is also extremely useful 
for the derivation of many fundamental properties. 

Despite the vast amount of literature about mathematical morphology, the 
limiting behaviour of the repeated applications of a morphological operator has 
not received a lot of attention. Partly, this is because in many cases it is a trivial 
problem: a large class of operators are idempotent (e.g., openings, closings) or 
decreasing (increasing) functions with a natural lower (resp. upper) bound (e.g., 
erosions, dilations, openings by reconstruction). This papers aims to make a 
contribution for non-trivial cases. 

The organization is as follows: in section 2, we describe the class of mor- 
phological operators, previously introduced in [11], and for which we prove the 
convergence in section 3. Section 4 explains how the results can be extended to 
other morphological operators. 

In what follows, X will denote a discrete image defined on a finite rectangular 
grid; returns the gray level value at position x. In the case of a binary image, 
in order to simplify the notation, X denotes also the set of black pixels, depending 
on the context. The cardinality of a set S is given by [S']; the translation of S 
by t, {s + t,s G S} is denoted by 5'(t) and the symmetric set of S with respect 
to the origin, by S. 



G. Bertrand et al. (Eds.): Digital and Image Geometry, LNCS 2243, pp. 429—438, 2001. 
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2 Robust Morphological Operators 

2.1 The Binary Case 

Our starting point is the classical definition of an opening by a symmetric set S 
(called a structural element) : a pixel x belongs to the opening of a binary image X 
by the structural element 5" iff a: belongs to at least one translation of S totally 
included in X . 

A natural generalization is to weaken the total inclusion requirement to sev- 
eral partial ones, i.e., we require that x belongs to at least r 2 translations of S, 
which have at least r\ pixels in common with the image X . The above can be 
formalized in the following way [9], [11]. 

Definition 1. For a given struetural element S, we define the morphological 
operator as: 



RY\X) = p^^g{p,,^s{X)), ( 1 ) 

where Pr,s{‘) denotes the r—rank operator 

p,, 5 (X) = {x: |.5(,)nA| >r}. (2) 

One observes that Rg’^^^{-), and Rg^{-) correspond re- 

spectively to a closing {X • S), opening {X o S), a double erosion and dilation. 

2.2 The General Case 

One way to extend Def. 1 for binary images to gray scale images is through the 
use of the set of slices X~^{u) = {x, X^ > m}. 

Definition 2. If X is a gray seale image and S a subset of pixels, we define the 
morphological operator as: 

R^'s’^'^X),, = max{u : x e (3) 

Using the geometric interpretation of the operator for the binary case, one ob- 
tains: 

Proposition 1. The value of Rfg''^'^{X) at x eorresponds to the largest possible 
value u such that x is contained in at least r^ translations of S, that have at 
least ri pixels in common with X~^{u). 

^ Also known in the literature as order statistic filter, percentile filter or E filter. 
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2.3 Motivation 

A motivation to introduce the above class of operators was given in [11]. As 
the result of X o S' and X • S is a basic step in the calculation of the Pattern 
Spectrum and is used in many composed operators like Top-hats, it is important 
to have at one’s disposal a good estimator for X o S and X* S, when only N{X), 
a noisy version of X, is available. 

In [11] it was shown that if X(X) is the result of limited (< 5%) pepper and 
salt noise distortion on X, R^g’^^{N{X)) is a robust approximation for X o S 
and X • S (using appropriate values of ri and T 2 ). 

The above should be contrasted with the construction of the Rank-Max fil- 
ters [5] or, equivalently, Regulated Openings [1], where one embeds the classical 
morphological openings in a larger (parametrized) family, conserving the prop- 
erties of an algebraic opening. Although less sensitive to noise than the morpho- 
logical openings, several experiments showed that outperforms them. 

Of course, a drawback is the absence of the idempotency property. In the 
next section, we study the limit behaviour under repeated applications. To this 
end, we define for any value of r\ and r 2 '. 

T\X) = Rg^’^^{X), T\X) = i?s^’'’"(T*-^(X)). 

3 Limiting Behaviour 

3.1 The Binary Case 

Proposition 2. For any X, T*(X) will converge to a fixed image when t tends 
to infinity. 

Proof. It is illustrative to describe the dynamical behaviour of T*(X) by means 
of a cellular automaton. To this purpose, consider an image X of size n x m as 
one long vector of size nm with elements belonging to {0,1}. We define the state 
space of the automaton as a vector Y* with two components, Y = (Yf,Yf), each 
of size nm. The dynamics of Y* is described by: 

= (Pr2,s0^2~^)^ Pri.siY*-^)) Y° = (X, X) with X a given image. (4) 

Consequently, Yf*^ corresponds to T‘(X), the application of t times i?g'^’’'^(-). 
We construct: 

where Ns is the nm x nm matrix: 



{^s)x,y — 



1 if ?/ € 5'(3,) 
0 otherwise. 
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As y € S(^x) iff a; S ^ symmetric matrix. Next, define the vector 

9 = (r 2 , • • • , T 2 , ri, • • • , ri) where ri and V 2 are repeated nm times. 

All together, because of (4), one obtains: 

Y* = l{MsY*-^ -9), (6) 

where l(-) is the vectorized threshold function (i.e. (yi, • • • , y„) = l((a;i, • • • , x„)) 
iff Ui = I {xi > 0) and /(•) denotes the Indicator function). 

As there are only a finite number of elements in our state space, we can 
always find a 02 in such a way that: 

MsY*-^ - 0 > 0 iff MsY*-^ - 02 > 0 

and 

MsY*-^ - 0 < 0 iff MsY*-^ - 02 < 0. 

In the sequel we suppose that 0 = 02 to guarantee that: 

- 0 yf 0. (7) 

Following the derivation in [2] - which for the sake of completeness we in- 
clude here - a Liapunov function is introduced, associating an energy with every 
state Y : 

E{Y^) = ~{Y\Ms ■ Y*-^) + (0,y‘ -b 

where (., .) denotes the usual inner product. 

Using the fact that Ms is a symmetric matrix, 

AE := E{Y^) - E{Y*-^) = - - Yt^){Y^{MsUyY*-^ - 0,)}. (8) 

X y 

By construction each term in the outer sum in (8) is positive, hence AE is 

always non-positive. As the state space is finite, from a certain moment on, E 

remains constant [2]. Using once more that each term in the outer sum in (8) is 
positive, together with (7), AE = 0 implies that there exists a given to, such 
that for t > to ■ 

_ Y^2to 

i.e., one obtains convergence of T‘(-) to a fixed image. □ 

As an illustration, in Fig. 1 the fixed points are shown under repeated appli- 
cation of R^s’^^i') for different parameter values. 

The following property describes the fixed points (images) in the particular 
case where S equals a discrete circle of radius 1 in the city-block metric. 

Proposition 3. If S is a circle of radius 1 in the city-block metric, 

1. the limit points ofR^^{-) are of the form UiRi, where Ri are rectangles with 
a minimum (Euclidean) distance of \/2 between them, i.e.: 

miUr,,eRk,rieRidi'r'k,ri) > \f2, for I k; 



(9) 
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2. the limit points ofR^^(-) are of the form UiRi, where Ri are rectangles with 
a minimum (Euclidean) distance of two between them, i.e.: 

minr,^eR^^rieRid{rk,ri) > 2, for I k. 

Proof. 

1. It is sufficient to show that the configurations of 2 x 2 pixels, shown in 
Fig. 2(a), can not form part of a fixed point(image). These configurations repre- 
sent the only possible situations that can not be written as the union of rectangles 
satisfying (9). 

Suppose the opposite holds; independent of the values of the neighbouring 
points, the application of to any of the configurations of Fig. 2(a), will 

give rise to the configuration shown in Fig. 2(b) causing a contradiction with the 
fact that Fig. 2(a) forms part of a limiting point. 

In Fig. 3 a limit point is shown that illustrates that the lower bound of the 
distance between the rectangles can be attained. 

2. In this case, we obtain a classical 2— threshold cellular automaton for which 
Prop. 4.5 of [2] can be applied. 

□ 



3.2 The General Case 

In order to extend Prop. 2. to gray scale images, we make use of the concept of 
an umbra, defined as [8]: 

Um(X) = {(x,u) : u < X^}. (10) 

If we abbreviate {(a,x), a € A} by {A,x) with A an arbitrary set, an umbra of 
an image is equivalent to: 

Um{X) = ^u{X-\u),u). (11) 

We denote by the operator that maps Um{X) into X. 

Using the above, we can associate with any order preserving set function /(•) 
(i.e., A C B implies f{A) C f{B)) an operator !F{-) on X in the following way: 

E{X) = Um-\Uu{f{X-\u)),u)). 

The following lemma shows how the result of a composition of certain func- 
tions acting on an image, is related to the calculation of a composition of corre- 
sponding functions acting on each slice of the image. 

Lemma 1. Suppose that /(•) and g(-) are two order preserving set functions. If 
we define T{-) and ^(-) by: 

E{X) = Um-\Uu{f{X-\u)),u)), g{X) = Um-\Uu{g{X-\u)),u)), 



then 



E{g{X)) = Um-\Uu{f{g{X-\u))),u)). 



1. (a): th( 
:k denotes 
imit of R 
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Fig. 3. A limit point of composed of four rectangles, the two in the 

middle are separated by a distance of 



Proposition 4. For any structural element, S, repeated applications of {■) 
will converge to a fixed image. 

Proof. If /(•) is an order preserving set function and Z a gray scale image, we 
have the following equivalence, which is easy to demonstrate: 

Um{Z) = Uu{f{X~^{u)),u) iff Vx : Za; = max{it : a: € /(A“^(m))}. (12) 

Hence, Def. 2 is equivalent to: 

= Uu{p,^siPrus{X-\u))),u). (13) 

Using Lemma 1., if we choose /(•) = g{-) = ’’'^(•), then repeated applications 

of ’’'^(■) is equivalent to repeatedly applying this operator to each slice and 
taking the union (or equivalent the supremum) at the end. For each slice. Prop. 2 
guarantees convergence. □ 

An example is shown in Fig. 4 for different values of ’^^(-). As one observes 
in Fig. 4(d), the limit point is not sensitive to the presence of low salt and pepper 
noise in the original image. 

Although for all the above examples convergence was reached in less than ten 
iterations, a considerable speed-up (25%) of the calculation time was obtained 
using a moving-histogram algorithm to calculate pr,s{X), instead of its naive 
direct implementation. The basic idea of the moving -histogram algorithm [10] is 
to update the histogram of the gray values of the pixels covered while sliding 
the structural element over the image. This is usually a much smaller set than 
all pixel values, the update step is extremely efficient and the value of pr^s{X) 
can be easily obtained. 

In the particular case of binary images, other alternatives are available. We 
refer to [12] for a recent overview. 

4 Extensions to Soft Morphological Filters 

The above construction is not only useful for the class of operators introduced in 
section 2. As an example, we consider the following class of soft morphological 



436 Johan Van Horebeek and Ernesto Tapia-Rodriguez 




(c) (d) 

Fig. 4. (a): the given image X; (b): limit of (c): limit of R^’J{X); (d): 

limit of Rg’J {N{X)) where N{X) denotes X corrupted by 1% salt and pepper 
noise and S'!, S 2 denote a circle of radius 1 resp. a 3 x 3 rectangle 
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filters (several definitions appear in the literature; we follow the one of [4]). For 
the sake of simplicity, we restrict ourselves to soft morphological openings for 
binary images. 

Definition 3. [4] Given two symmetric sets , S'^ , C S'^ and r < \S^ \ 

S^\, define the soft morphological opening, X o [5”^ , 5*^ , r] , as: 

Xo[S^,S^,r] = {Xe[S^,S^,r])(B[S^,S^,r], (14) 

where {X © [5'^, {X © [5”^, 5''^, r])a; denote the rth smallest, respec- 

tively, rth largest value of the multiself (r o n X)) U \ S'^)(^,c) F X), 
and where o denotes de repetition operator (roS equals r times the multiset S). 

In a similar way, a soft morphological closing is defined. 

Proposition 5. The repeated application of a soft morphological opening (clos- 
ing) will converge to a fixed image. 

Proof. Only minor changes in the proof of Prop. 2 have to be done. The dynamics 
can be described by a dynamical system similar to (6). In (5), we should now 
define: 

r r if 2 / G 5^) 

(iVs)x,,= <^ 1 if 2/ 

[ 0 otherwise. 

As Ns is also symmetrical, the same reasoning of the proof of Prop. 2 applies 
and the same conclusions hold. 

□ 



5 Conclusions 

In this paper we have proven that the repeated application of a certain class of 
discrete image operators converge to a fixed image. Although only some modest 
but promising results concerning the characterization of those limiting points 
were included, a lot of work still has to be done. 

Some open questions are: For which parameter values of ri,r 2 do we obtain 
a union of convex sets as the limit image? When do we obtain a non-trivial limit 
(i.e., an image that is not completely black or white)? How do we obtain an tight 
upper and lower bound of this image? How do we obtain a good upper bound 
of the convergence speed? 
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Abstract. We discuss the optimal grouping of collinear line segments, 
where we use a digital form of collinearity as the grouping criterion. We 
also introduce weak collinearity as a new concept to facilitate the design 
of an efficient line grouping algorithm. This algorithm is based on the 
use of simplicial elimination orderings, as known in graph theory. 



1 Introduction 

Extracting groups of lines according to some geometric criterion such as collinear- 
ity or parallelism is an important problem in perceptual grouping. In this paper 
we discuss optimal grouping with respect to digital collinearity, a property de- 
fined within the framework of geometry in the digital plane. To increase the effi- 
ciency of the grouping process, we also introduce the concept of weak collinear- 
ity as an approximate form of digital collinearity. Using weak collinearity, we 
propose one particular grouping algorithm, which is based on simplicial vertex 
elimination, an idea borrowed from graph theory. 

In computer vision the partitioning of objects into groups according to some 
similarity criterion is viewed as a clustering problem. We can make the distinction 
between vectorial based clustering and relation based clustering [5] . In vectorial 
based clustering, features are represented as vectors in an n-dimensional feature 
space, e.g., a feature may consist of the slope, length, and midpoint of a line 
segment. The distance between feature vectors is used as a measure of similarity. 
Alternatively, relation based approaches introduce the strength of a relation as a 
similarity measure. For example, Lowe as well as Nacken introduce a metric that 
measures how far two line segments are from being collinear [6,7]. The relation 
based approach has the advantage of greater flexibility as for the choice of an 
appropriate function that measures the similarity between features. A recurring 
problem, however, is that such a function does not necessarily satisfy the triangle 
inequality, a desirable condition when the criterion used for grouping is based 
on a transitive property, as is the case for collinearity [7]. 

In this work we propose a relational approach that is still strongly linked 
with the feature vectors. Instead of using distances between feature vectors we 
will embed each feature vector into a region called an uncertainty region, and 
we will use the intersection properties of the uncertainty regions to determine 
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which vector pairs represent collinear line pairs. Although we loose some of the 
flexibility of a purely relational approach, and the simplicity of purely vectorial 
approach, there are some important advantages: 

— We have a coherent framework in which we can define geometric properties 
such as digital parallelism, collinearity and concurrency; in each case, line 
grouping can be formulated as a combinatorial optimization problem. 

— Line grouping is hierarchical process, where lines are combined into groups, 
and groups of lines into higher level structures; by using uncertainty regions 
hierarchical grouping becomes a natural process; 

— One of the main conclusions of this work is that the failure of transitivity 
can be related to other issues such as the non-perfectness of graphs; the use 
of uncertainty regions better explains this relation. 

The digital versions of collinearity, parallelism and concurrency have been 
studied before [9] . In previous work on line grouping we focused on the grouping 
of parallel lines. Grouping parallel lines is greatly facilitated by the fact that 
parallelism involves only one parameter, i.e. the slope of a line. In fact, one can 
show that parallelism satisfies a one dimensional Helly type property [9], and 
that the parallel relationships present in an image can be represented by an 
interval graph. For interval graphs we have efficient combinatorial optimization 
algorithms, leading to fast algorithms to extract the geometric structure. For 
example, once the interval graph has been constructed for a collection of N 
lines, there are algorithms of 0(iV log iV) time complexity for extracting the 
largest set of parallel lines, as well as for the optimal partitioning into parallel 
subgroups, or for finding the largest set of non-parallel lines. 

Collinearity on the other hand involves at least two line parameters, slope 
and height, and, not surprisingly, it is more difficult to solve geometric opti- 
mization problems for collinearity than for parallelism. There are two causes for 
this difficulty. First, digital collinearity satisfies a two-dimensional Helly type 
property [9], and as a result the appropriate combinatorial structure for repre- 
senting collinearity is a hypergraph. Second, under certain assumptions, we can 
replace collinearity by the simpler property of “weak collinearity” , whose rela- 
tionships can be represented in an ordinary graph. However, a weak collinearity 
graph is the intersection graph of a collection of two-dimensional convex sets, 
and by nature this kind of intersection graph is more difficult to process than the 
intersection graphs of one-dimensional intervals, which represent parallelism. 

In this paper, to illustrate the notion of weak collinearity, we restrict our- 
selves to one particular geometric problem, i.e., partitioning a collection of lines 
into a minimum number of (weakly) collinear groups. This problem is trivial 
in Euclidean geometry, but not in the digital plane. First, we reformulate the 
partitioning problem as a well posed, but NP-complete optimization problem in 
graph theory. Next, we propose a heuristic algorithm that often finds an optimal 
solution. Although the proposed algorithm fails occasionally, one of its advan- 
tages is that we know when the solution is optimal. In fact, the circumstances 
of failure can be related to the occurrence of certain line configurations in the 
digital plane. 
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Finally note that in this work we use the “discretization by dilation” scheme, 
which has been formalized by Heijmans and Toet [4] . One of the advantages of 
using the discretization by dilation scheme is that it leads to a number of well 
defined combinatorial optimization problems [10,11,12]. This advantage will also 
be exploited in this work. 

Section 2 gives a brief overview of properties in the digital plane. In Section 3 
we introduce the notion of weak collinearity, and we briefly consider the major 
optimization problems with respect to collinearity as well as the major obsta- 
cles that arise when designing efficient algorithms to solve these problems. In 
Section 4 we propose a simplicial grouping algorithm to solve the problem of 
partitioning a set of lines into a minimum number of collinear groups. Finally, 
in Sections 5 and 6 we give experimental results, and we conclude the work. 



2 Geometric Properties in the Digital Plane 

In this section we recall some results obtained in previous work [9,10,12]. Since 
image pixels lie on a rectangular grid, we assume that all pixels are part of 
the digital plane 2Z^ ^ although this is not strictly necessary for what follows. A 
digital set S' is a subset of the digital plane. 

We model uncertainty by an uncertainty region that we associate with each 
grid point. The discretization process that coincides naturally with this notion of 
uncertainty is the discretization by dilation scheme developed by Heijmans and 
Toet [4]. Let U denote a set in IR^, let A be a second subset of IR^, called the 
structuring element, and let Ap be the translate of A by p. Then the discretization 
by dilation of U consists of all points p G for which Ap n 17 is non-empty [4] . 

Furthermore, to simplify the exposition in this paper we shall restrict our- 
selves to one particular form of discretization by dilation, that is, we use a simple 
variant of grid-intersect discretization. To model the uncertainty of its position, 
for each digital point p = {x,y), we introduce as translate of the structuring 
element the vertical line segment Cp(t), which comprises all points (x, 6) G IR^ 
that satisfy y — t/2 < b < y + tI2. Here r is a positive real number, called 
the acceptable thickness. To avoid confusion between a vertical “segment” and 
a digital straight line “segment” we shall call a vertical segment an uncertainty 
segment. Up to a certain multiplication factor, the notion of acceptable thickness 
coincides with Reveilles’s notion of arithmetical thickness of a discrete straight 
line, also discussed by Andres [8,1]. To simplify what follows even further, we 
shall only discuss the extraction of straight lines of the form y = ax + (3, where 
the slope a satisfies — 1 < a < 1. We assume that each set S contains at least 
two points with distinct x-coordinates. 

Although the foregoing restrictions do not change our results in a fundamen- 
tal way, they allow us to discard several special cases discussed more thoroughly 
in previous work [9], where we show, for example, that for lines whose slope lies 
between -1 and 1, vertical uncertainty segments are sufficient to model position 
uncertainty. 
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Fig. 1 . Digital collinearity and parallelism 



Definition 1 . A digital set S is called digitally straight if there is a continuous 
straight line that cuts all uncertainty segments C'p(r), p G S. 

A finite digitally straight set will also be called a digital straight line segment. 

Definition 2 . Let Si, . . . ,Sn a finite collection of finite digitally straight sets. 
We define the following digital geometric properties: 

— The sets {Si : i = l,...,n} are called digitally collinear if there exists a 
common Euclidean straight line A that cuts the uncertainty segments of all 
the sets Sp, 

— The sets in {Si : i = 1, . . . , n} are called digitally parallel if there exist n 
Euclidean straight lines Ai, . . . , A„ that are parallel and such that for i = 
I,. .. ,n the line Ai cuts the uncertainty segments of the set Si; 

Thus the sets Si are digitally collinear when their union is digitally straight. 
From now on, we shall often call two digital sets simply “collinear,” when it is 
clear we mean digitally collinear. Digital collinearity and parallelism are illus- 
trated in Figure 1. Si and S 2 are collinear since their uncertainty segments are 
cut by the common Euclidean line A. Likewise, S2 and S3 are collinear because of 
the common line transversal B. Moreover, S 2 is digitally parallel with 5 * 4 , since 
B\\C in the Euclidean sense. Finally note that although is collinear with S 2 , 
S2 with S3, as well as Si with ^ 4 , is not collinear with S3 because there is no 
common line cutting all their uncertainty segments. For the same reason is S '4 
not collinear with S2, nor with S3. Evidently, in the digital plane collinearity and 
parallelism are not transitive. In [9] the following result is derived as a Helly-type 
theorem: 

Theorem 1 . Let S = {Si, . . . , Sn} a finite collection of finite digital straight 
line segments, with n > 2 . Then the collection S is digitally collinear if each 
suhcollection containing three sets of S is digitally collinear. 

We shall need Theorem 1 when we reformulate optimal grouping as a combina- 
torial optimization problem. 
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Definition 3. Let 5 be a finite digital set that contains at least two points with 
distinct x-coordinates, and let r be a chosen acceptable thickness with 0 < r. 
Then the domain of S, denoted as doni3;(5'; r), is the set of all parameter points 
(a, (3) € that satisfy the following system of inequalities: 

- t/2 < axi + /3 - yi < t/2, (xi,yi) e S. (1) 

In other words, the domain contains the parameters of all the Euclidean lines 
that cut the uncertainty segments of S. It follows almost immediately that a 
collection of digital line segments is digitally collinear if and only if their domains 
have a common non-empty intersection [9,10]. 

As an example, consider the collection of digital sets representing lines in 
Fig. 2(a), whose domains are shown in Fig. 2(c), for an acceptable thickness 
T = 1. A domain of a digital set with N points is a convex bounded set, and can 
be computed in 0{N log N) time as an intersection of 2N halfplanes [10]. When 
we let r vary, a domain gets larger for increasing values of r. If the acceptable 
thickness is too small, however, the domain of a set will be empty. For a given 
set S, let I? C IR be the set of acceptable thicknesses for which the domain of S 
is non-empty. The thickness of S is defined as the infimum of D [9]. 

We also need some basic notions of graph theory (see, e.g., [13]). Let G 
be a simple graph, i.e, a graph that contains no loops or double edges. An 
independent set is a set of vertices that does not contain any pair of adjacent 
vertices. A coloring assigns an index to each vertex so that two adjacent vertices 
always receive distinct indices. An optimal coloring uses a minimum number of 
indices, i.e., the chromatic number of the graph. A graph is perfect if each of 
its induced subgraphs has a chromatic number equal to the size of its largest 
complete subgraph. A graph is chordal or triangulated if the only cycles that 
occur as induced subgraphs are triangles. 



3 Collinearity and Weak Collinearity 

Given a set of digital line segments, we shall represent collinear relationships in 
two different ways. 

Definition 4. Let C be a collection of digital straight line segments. In the weak 
collinearity graph G(V, E) each line segment in C is represented by a vertex. Two 
vertices in G{V, E) are joined by an edge if the two corresponding line segments 
are digitally collinear. The collinearity hypergraph H(V,E) also has one vertex 
for each line segment in G. The edge set E consists of all 2-membered and 
3-membered subcollections that are digitally collinear. The complement of the 
collinearity hypergraph is defined as the hypergraph H'(V,E) with one vertex 
for each line segment in C, and whose edge set E of H'{V,E) consists of all 
2-membered and 3-membered subcollections that are not digitally collinear. 

A collection of digital line segments is called weakly collinear if the cor- 
responding vertices induce a complete subgraph in the weak collinearity graph 
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(a) (b) 




Fig. 2. (a) A set of lines, (b) the weak collinearity graph, and (c) the domains 



G(F, E). Figure 2(b) shows the weak collinearity graph of the lines in Figure 2(a). 
Two vertices in Figure 2(b) are adjacent if the corresponding domains in Fig- 
ure 2(c) have a non-empty intersection. The collection of line segments A, B, C, 
and D is weakly collinear, since they induce a complete subgraph in Figure 2(b). 

We shall concentrate on optimal grouping, i.e., to find the minimally sized 
collection of collinear groups. Due to Theorem 1 this problem can be solved with 
the collinearity hypergraph or its complement. To be precise, optimal grouping 
corresponds to an optimal coloring of the complement of the collinearity hyper- 
graph. The edges of the complement H'{V,E) are formed by all non-collinear 
pairs and triples. In a hypergraph coloring no edge is monochromatic. Thus, each 
color corresponds to a group that contains no non-collinear pairs or triples, or 
in other words, in each group all pairs and triples are collinear. By Theorem 1 
it follows that the entire group is collinear. 

Optimal grouping with the collinearity hypergraph is a difficult combinato- 
rial problem. In this work our goal is to solve the optimal grouping problem with 
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the weak collinearity graph, or at least to find a suboptimal solution. To appre- 
ciate the introduction of weak collinearity we note that weak collinearity suffices 
to solve at least one geometric optimization problem: to find the maximum set 
in which no pair is collinear. To solve this problem, it suffices to look for the 
maximum independent set of the weak collinearity graph. Also other optimiza- 
tion problems benefit from the introduction of weak collinearity, but we shall 
not consider them here. 

As for optimal grouping, the introduction of weak collinearity is not without 
cost, however. Although a collinear collection of digital line segments is always 
weakly collinear, a collection that is weakly collinear may not be collinear. In ad- 
dition, although weak collinearity simplifies problems, a weak collinearity graph 
may still be difficult to handle. In particular, a weak collinearity graph may not 
be chordal, or it may even fail to be perfect. In fact, if we are lucky it may happen 
that for a given collection of line segments all weakly collinear subcollections are 
also collinear, which means that we can use the weak collinearity graph to solve 
the optimal grouping problem. However, even in this case, if the weak collinearity 
graph is not perfect, then the grouping problem is still NP-complete [3]. On the 
other hand, we may have a collection of line segments in which some triples are 
weakly collinear, but not collinear. Nonetheless, the weak collinearity graph can 
be still be chordal, so that we have an efficient algorithm to solve the grouping 
problem for weakly collinear groups. 

Although the above difficulties seem to be different, closer examination shows, 
however, that each of the above anomalies arises from a similar cause, i.e., the 
existence of a cyclic configuration of the domains. We start with an example 
without anomalies. Figure 3(a) shows a collection of four line segments and 
their domains. In this case, the domains have a common non-empty intersec- 
tion, the entire set is collinear, and collinearity coincides in a trivial way with 
weak collinearity. By contrast. Figure 3(b) shows three line segments and their 
domains. Each pair of domains has a non-empty intersection. The three domains 
do not have a common intersection, however. That is, although the three line 
segments are weakly collinear, they are not collinear. The difference between 
collinearity and weak collinearity reflects itself in a cyclic configuration of three 
domains that circle a hole in the parameter space. Figure 3(c) shows four line 
segments and their domains. In this case we have a cyclic chain of four domains 
that circle a hole. The weak collinearity graph is a chordless 4-cycle, and is 
therefore non-chordal. Finally, Figure 3(d) shows five lines. The weak collinear- 
ity graph of this set is a chordless 5-cycle, and the weak collinearity graph is not 
perfect. In fact, according to Berge’s strong perfect graph conjecture a graph is 
perfect if and only if the graph neither has an induced subgraph that is an odd 
cycle of length at least five, nor the complement of such a cycle. In each of the 
cases (b), (c) and (d), an anomaly arises from the existence of a closed chain of 
domains that circle an empty hole. 
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Fig. 3. (a) Normal case , (b) weak collinearity differs from collinearity, (c) non- 
chordal graph, (d) non-perfect graph 




4 Simplicial Grouping 

The problem of finding an optimal grouping of (weakly) collinear line segments 
is the same as finding a minimum clique covering for the weak collinearity graph, 
which in turn is equivalent with finding an optimal coloring of the complemen- 
tary graph. Finding a minimum clique covering for a graph is NP-complete. Our 
goal is therefore to find a suboptimal algorithm, and we start with the following 
naive approach. First, we select one vertex at random, and then we look for 
the maximal clique that contains it. Next, after removing this maximal clique, 
we repeat the process for the smaller graph. There is no efficient procedure, 
however, for selecting a vertex whose maximal clique is also a maximum clique, 
because finding a clique of maximum size is in itself an NP-complete problem. 
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In fact, even finding the maximal clique that contains the selected vertex may 
be surprisingly difficult, when the degree of the vertex is large, since this prob- 
lem is equivalent to finding the maximum clique of the graph induced by the 
neighborhood of the vertex. 

Because the naive approach fails, we propose a more sophisticated method 
based on work by Gavril [2]. Gavril has shown that we can find a minimum 
clique covering if instead of selecting vertices at random we restrict ourselves to 
simplicial vertices. The following definition is taken from West [13]. 

Definition 5. A vertex of a graph is simplicial if its neighborhood (not including 
the vertex itself) in G induces a clique. A simplicial elimination ordering is an 
ordering vi, . . . ,Vn in which vertices can be deleted so that each vertex is a 
simplicial vertex of the remaining graph. A simplicial construction ordering is 
the reverse of a simplicial elimination ordering. 

One can show that a graph is chordal if and only if it has a simplicial elimina- 
tion ordering [2]. Gavril has introduced the following minimum clique covering 
algorithm for chordal graphs. Let v\, . . . ,Vn be a simplicial elimination order- 
ing, let Vi denote the neighborhood of Vi with Vi included, and let Gi denote 
the graph induced by all vertices that are in G but not in one of the neigh- 
borhoods Vi, . . . , Vi-i- We construct a sequence of neighborhoods Yi, ... ,Yk as 
follows: Yi = Vi, and L) = Vj where Vj is the first vertex in the ordering vi, ... ,v„ 
that is still contained in Gi. Then Y\, . . . ,Yk is a minimum clique covering. 

Gavril’s algorithm is intended for chordal graphs, but we shall extend the 
approach to graphs in general. We introduce the following algorithm, which 
constructs a sequence of cliques of G. 

Simplicial Grouping Algorithm. 

Input: a weak collinearity graph G{V,E). 

Output: a sequence T of disjoint cliques of G, and a sequence U of independent 
vertices. 

1. Let G' = G, i = 1, and initialize T and U as empty sequences. 

2. If G' has no simplicial vertices, then stop and return the current sequences T 
and U. 

3. Select at random a simplicial vertex qi of G', and let Qi be the vertices of 
the clique formed by qi and its neighbors. 

4. Append Qi to the sequence T, and append qi to the sequence U. 

5. Replace G' by the graph induced by all vertices of G' that are not in Qi. 

6. Go to the second step. 

It is important to note that the sequence U = {qi,q 2 , . . .} does not have to 
form a simplicial elimination ordering. In a simplicial elimination ordering each 
vertex is or becomes simplicial when all preceding vertices have been deleted. 
In the simplicial grouping algorithm, a vertex is or becomes simplicial when all 
preceding vertices and their neighbors have been deleted. 
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The output of the simplicial grouping algorithm is a finite sequence of 
cliques For some graphs, the vertices in this sequence will cover 

the entire graph. There are graphs, however, for which the algorithm terminates 
prematurely because no more simplicial vertices can be found. Now suppose 
a graph is not chordal, but that the simplicial grouping algorithm nonetheless 
succeeds in grouping all vertices in the graph G. Then we can show that the 
sequence Qi, . . . , Qm is in fact a minimum clique covering. We start with the 
following simple result, whose proof is almost immediate. 

Lemma 1. Let G(V^ E) be a weak collinearity graph, and let H{V,E') he a sec- 
ond graph with the same vertex set, and with E' C E. Let V he a minimum 
clique covering of G{V,E). Lf V is a clique covering for H{V,E'), then it is a 
minimum clique covering for H(V, E'). 

Proof. Clearly, any clique covering for H is a, clique covering for G. If H can be 
covered with fewer cliques than the number of cliques in V, then G also can be 
covered with fewer cliques and V would not be optimal. 

Proposition 1. The simplicial grouping algorithm yields a minimum clique cov- 
ering V = T of the graph G if it succeeds in grouping all vertices of G, i.e., if 
all vertices occur in the sequence T . 

Proof. Let Qi, . . . , Qm be the output of the simplicial grouping algorithm cov- 
ering all vertices in G. Let U = {qi, ... ,qm} he the simplicial vertices selected 
during the grouping process. Clearly, U is an independent set. Let Vm+i, ■ ■ ■ ,Vn 
be the vertices of G that are not in U. Let K the complete graph with ver- 
tices Vm-i-iy • ■ • Wrtj 3'iid let H be the union of the graphs K and G. We claim 
that P[ is chordal and that qi, . . . , qm, Vm-ei, ■ ■ ■ ,Vn is a simplicial elimination 
ordering for H. In fact, no pair of vertices in U is adjacent, and each vertex in Lf 
is simplicial in H. It follows that H is a chordal graph, and that the collection C 
of all cliques of neighborhoods Qi of the vertices qi forms a minimum clique 
covering of H . By Lemma 1 it follows that C is also a minimum clique covering 
of G. 

Hence, if it succeeds in grouping all vertices, the simplicial grouping algorithm 
produces an optimal clique covering, even for non-chordal graphs. Next, we show 
that for chordal graphs the algorithm will in fact be able to group all vertices, 
although the algorithm is not guided by a simplicial elimination ordering, as was 
noted previously. 

Proposition 2. Let G be a simple chordal graph, then the simplicial grouping 
algorithm is able to group all vertices of G, i.e., all vertices occur in T. 

Proof. Let v\, ... ,Vn be any simplicial elimination ordering of G. If a vertex Vi 
is simplicial in a graph G, then it is simplicial in any induced subgraph of G. 
This follows from the simple fact that the neighbors of Vi still form a clique in 
any induced subgraph of G. Therefore, a simplicial elimination ordering is still 
a simplicial elimination ordering in any induced subgraph G' if we discard in 
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the ordering vi,V2, ■ ■ ■ all vertices that are not in G' . In particular, assume that 
the simplicial grouping algorithm starts by removing the simplicial vertex Vi, 
i ^ 1 and its neighbors, then v\ will still be a simplicial vertex in the new 
graph G' . As a result, after a finite number of steps v\ will also be removed 
from the graph. Likewise, sooner or later any of the vertices Vj in the simplicial 
elimination ordering will become simplicial in the remaining graph. 

From Propositions 1 and 2 it follows that for chordal graphs the simplicial 
grouping algorithm produces a minimum clique covering. However, since the 
simplicial vertices that are removed by the simplicial grouping algorithm can be 
randomly chosen from the set of simplicial vertices, the algorithm will produce 
different sequences T on different runs. We must therefore show that even when 
the algorithm terminates prematurely, it always removes the same set of vertices, 
although they may be classified in different groups. We need the following result. 

Lemma 2. Let G be a simple graph, and let u and v be two simplicial vertices 
of G. If u and v are adjacent then they belong to the same maximal clique. 

Proof. Since u is simplicial, u and its neighbors form a maximal clique, which is 
identical to the maximal clique formed by v and its neighbors. 

Proposition 3. Let G be a simple graph. Let T\ and T2 be the output sequences 
of the simplicial grouping algorithm after two different runs. Then the T\ and T2 
cover the same subset of vertices of G, i.e., each vertex that occurs in T\ also 
occurs in T2, and vice versa. 

Proof. Suppose we run the grouping algorithm twice. Let Y = {{yf, . . . ,yj}, 
{yf,...,y^}, {yf be result of the first maximal grouping process, and 
let V = ...,}, {vf , be the result of the second grouping process. 

Furthermore, without loss of generality, we assume that the groups have been 
ordered in such a way that in each group the vertex y^ is a simplicial vertex, 
that is, after the vertices of all previous groups have been removed from G. 

It is sufficient to show that for any group Yk = {y^, . . .} in Y, each vertex in Yk 
also belongs to some group in V. We first consider the group Yi = {yf, . . . ,yl}. 
Since this is the first group in Y, the vertex y^ is simplicial in G and in any of its 
induced subgraphs that contain yf. First, assume that yf has not been removed 
by V. This is impossible since it would mean that there is still a simplicial 
vertex in G, and it would follow that V is not maximal. Second, suppose y^ 
has been removed by V. Then there is a group Vi = {v \, . . .} containing y{. By 
Lemma 2 , Vi also contains all the vertices in Y\, that have not yet been removed 
by the groups Vi, . . . , Vi-\. In fact, v\ and yf are simplicial and adjacent to each 
other, and so they belong to the same clique. 

Next, we consider Y2. When we remove the vertices in Y\ from G, the ver- 
tex yl becomes simplicial. Since all vertices of Yi have been removed by V, there 
must be a group Vj, j < I in V such that either by removing Vj also yf and 
all its neighbors are removed, or that after removing Vj, the vertex yj becomes 
simplicial. By the same arguments as before, it follows that all vertices of Y2 are 
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removed by V. By induction it follows that the vertices in each group Yk are 
removed by V. 

Summarizing, the simplicial grouping algorithm will produce an optimal re- 
sult for chordal graphs. For graphs that are not chordal, the algorithm yields 
an optimal result if it can group all vertices, which is often the case. Finally, 
for graphs for which the algorithm terminates prematurely, we know that it has 
removed as many vertices as possible, and although it is a random algorithm, 
running it twice will yield the same result. In fact, even when the algorithm 
terminates prematurely, we can derive upper and lower bounds for the size of a 
minimum clique covering. Let x(G) denote the chromacity of the complement 
of G. Hence, x(G) is the size of a minimum clique covering of G. Let Gi and G 2 
be two induced subgraphs of G, whose vertex sets partition the vertex set of G 
into two parts. Then x(G) < x(Gi) -|-x(G 2 ). Furthermore, for any induced sub- 
graph Gi of G we have x(Gi) < x(G). Let H be the subgraph of G induced by 
all the vertices that can be removed by simplicial grouping. It is clear that H 
can be simplicially grouped and that this grouping is optimal. Let F denote be 
the subgraph of G induced by those vertices of G that are not in H. Then 

x(H)<x(G)<x(H) + x(F)- 

Therefore, since the simplicial grouping algorithm is optimal for those vertices 
that have been removed, we know that if the algorithm is able to remove many 
vertices, i.e., when xi^) is small, the result is almost optimal. 



5 Experimental Results for Random Sets 

Although we have shown that simplicial grouping gives optimal or suboptimal 
results, we must still examine how it performs in practice, i.e., how many times 
it yields optimal results, and if the results are not optimal, how close they are 
being optimal. There is an intuitive reason for using simplicial grouping: weakly 
collinear line segments tend to occur in clusters. In fact, a weak collinearity 
graph is not a completely random graph. If A is collinear with B, and B collinear 
with G, then the probability that A and G are also collinear is higher than in 
a random graph. Thus clustering works both ways: it increases the probability 
of cycles, but it also increases the probability of chords in cycles. The grouping 
algorithm performs well when most cycles have many chords. 

The real performance of the grouping algorithms must be examined for real 
images. The random set of line segments in Figure 4(a) was generated as follows. 
We generated 250 random Euclidean line segments in 50x50 square. The slopes 
were uniformly distributed between -1 and 1. The length of the line segments 
was chosen such that after digitization we would have line segments of 8 pixels. 
After removing all line segments that were not collinear to at least one other line 
segment, 149 line segments were left, which are shown in Figure 4(a). Figure 4(b) 
shows the weak collinearity graph of this set, with 149 vertices and 163 edges. 
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Fig. 4. (a) A random set of line segments, (b) the weak collinearity graph, (c) 
the domains, (d) what remains of the weak collinearity graph after simplicial 
grouping, and (e) the largest groups 



Note that Figure 4(a) contains a large number of weakly collinear triples (35), 
but that only about one third (13) of these triples are collinear. 

For line segments of 8 pixels, each digital segment can be represented as the 
digitization of a line segment with slope fc/7, k = —7, —6, . . . , 7. This explains the 
regular pattern of vertical strips shown by the domains in Figure 4(c). Figure 4(d) 
shows an induced subgraph of the graph in Figure 4(b). It contains 47 vertices 
that cannot be grouped by simplicial grouping. Figure 4(e) shows the largest 
groups that have been found by the grouping process. The groups A,B,C,D,E 
have been found by simplicial grouping of the weak collinearity graph. The group 
F was found by greedy grouping of the vertices that remained ungrouped after 
simplicial grouping, i.e., the vertices shown in Figure 4(d). 
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6 Concluding Remarks 

A set of line segments is digitally collinear if their domains have a common 
non-empty intersection. Since the domains are two-dimensional convex sets sat- 
isfying a Helly-type property (Theorem 1), the optimal grouping problem for 
collinear line segments can be formulated as a combinatorial optimization prob- 
lem. To solve this problem we have introduced the notion of weak collinearity. 
Weak collinearity enables us to replace the collinearity hypergraph by an ordi- 
nary graph that represents the weakly collinear relationships. With the weak 
collinearity graph we have described a grouping algorithm that often produces 
optimal or suboptimal results. Furthermore, we know precisely whether the out- 
come is optimal or not, and why the result may not be optimal. In fact, a 
suboptimal result is either due to the occurrence of chordless cycles in the weak 
collinearity graph, or to the occurrence of triangles that are weakly collinear but 
not collinear. 

We did not discuss the combination of parallelism and collinearity. When we 
group lines according to parallelism as well as collinearity, a natural requirement 
is that collinear groups must be subgroups of parallel groups. Also in paral- 
lel grouping we use a simplicial grouping process [12], but the precise relation 
between the two processes must still be further examined. 
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